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NOGO RECEPTOR HOMOLOGS 
FIELD OF THE INVENTION 

5 The invention relates to neurology and molecular biology. More particularly, 

the invention relates to CNS neurons and axonal growth 

BACKGROUND 

Among the mechanisms through which the cells of an organism communicate 
10 with each other and obtain information and stimuli from their environment is through 
cell membrane receptor molecules expressed on the cell surface. Many such receptors 
have been identified, characterized, and sometimes classified into major receptor 
superfamilies based on structural motifs and signal transduction features. The receptors 
are a first essential link for translating an extracellular signal into a cellular 
15 physiological response. 

Receptors on neurons are particularly important in the development of the 
nervous system during embryogenesis. The neurons form connections with target cells 
during development through axonal extension of the neurons toward the target cells in 
a receptor-mediated process. Axons and dendrites have a specialized region of their 
20 distal tips known as the growth cone. Growth cones enable the neuron to sense the 

local environment through a receptor-mediated process and direct the movement of the 
axon or dendrite of the neuron toward the neuron's target cell. This process is known 
as elongation. Growth cones can be sensitive to several guidance cues, for example, 
surface adhesiveness, growth factors, neurotransmitters and electric fields. The 
25 guidance of growth at the cone depends on various classes of adhesion molecules, 

intercellular signals, as well as factors that stimulate and inhibit growth cones. 

Interestingly, damaged neurons do not elongate in the central nervous system 
(CNS) following injury due to trauma or disease, whereas axons in the peripheral 
nervous system (PNS) regenerate readily. The fact that damaged CNS neurons fail to 
30 elongate is not due to an intrinsic property of CNS axons, but rather due to the CNS 
environment that is not permissive for axonal elongation. Classical grafting 
experiments by Aguayo and colleagues (e.g., Richardson et al, (1980) Nature 284, 
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264-265) demonstrated that CNS axons can in fact elongate over substantial distances 
within peripheral nerve grafts, and that CNS myelin inhibits CNS axon elongation. 
Therefore, given the appropriate environment, CNS axons can regenerate, implying 
that CNS axonal injury can potentially be addressed by appropriate manipulation of the 
5 CNS environment. 

The absence of axon regeneration following injury can be attributed to the 
presence of axon growth inhibitors. These inhibitors are predominantly associated 
with myelin and constitute an important barrier to regeneration. Axon growth 
inhibitors are present in CNS-derived myelin and the plasma membrane of 
10 oligodendrocytes that synthesize myelin in the CNS (Schwab et al., (1993) Annu. Rev. 
Neurosci. 16, 565-595).' Myelin-associated inhibitors appear to be a primary 
contributor to the failure of CNS axon regeneration in vivo after an interruption of 
axonal continuity, whereas other non-myelin associated axon growth inhibitors in the 
CNS may play a lesser role. These inhibitors block axonal regeneration following 
15 neuronal injury due to trauma, stroke or viral infection. 

Numerous myelin-derived axon growth inhibitors have been characterized (see, 
for review, David et al., (1999) W0995394547; Bandman et al., (1999) U.S. Patent 
No. 5,858,708; Schwab, (1996) Neurochem. Res. 21, 755-761). Several components 
of CNS white matter, NI35, NI250 (Nogo) and Myelin-associated glycoprotein 
20 (MAG), which have inhibitory activity for axonal extension, have been described as 
well (Schwab et al., (1990) W09005191; Schwab et al., (1997) U.S. Patent No. 
5,684,133). In particular, Nogo is a 250 kDa myelin-associated axon growth inhibitor 
that was originally characterized based on the effects of the purified protein in vitro 
and monoclonal antibodies that neutralize the protein's activity (Schwab (1990) Exp. 

25 Neurol. 109,2-5), The Nogo cDNA was first identified through random analysis of 
brain cDNA and had no suggested function (Nagase et al., (1998) DNA Res. 5, 
355-364). The identification of this Nogo cDNA as the cDNA encoding the 250 kDa 
myelin-associated axon growth inhibitor was discovered only recently (GrandPre et al. , 
(2000) Nature 403, 439-444; Chen et al., (2000) Nature 403, 434-439; Prinjha at al., 

3 0 (2000) Nature 403 , 3 83 -3 84) . 

Importantly, Nogo has been shown to be the primary component of CNS 
myelin responsible for inhibiting axonal elongation and regeneration. Nogo's selective 
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expression by oligodendrocytes and not by Schwann cells (the cells that myelinate P.S. 
axons) is consistent with the inhibitory effects of CNS myelin, in contrast to P.S. 
myelin (GrandPre et al. , (2000) Nature 403, 434-439). In culture, Nogo inhibits 
axonal elongation and causes growth cone collapse (Spillmann et al., (1998) J. Biol. 

5 Chem. 272, 19283-19293). Antibodies (e.g., IN-1) against Nogo have been shown to 
block most of the inhibitory action of CNS myelin on neurite growth in vitro 
(Spillmann etal., (1998) J. Biol. Chem. 272:19283-19293). These experiments 
indicate that Nogo is the main component of CNS myelin responsible for inhibition of 
axonal elongation in culture. Furthermore, in vivo, the IN-1 antibody has been shown 
10 to enhance axonal regeneration after spinal cord injury, resulting in recovery of 

behaviors such as contact placing and stride length (Schnell and Schwab (1990) Nature 
343, 269-272; Bregman et al., (1995) Nature 378, 498-501). Thus, there is 
substantial evidence that Nogo is a disease-relevant molecular target. Agents that 
interfere with the binding of Nogo to its receptor would be expected to improve axonal 
15 regeneration in clinical states in which axons have been damaged, and improve patient 
outcome. 

Modulation of Nogo has been described as a means for treatment of 
regeneration for neurons damaged by trauma, infarction and degenerative disorders of 
the CNS (Schwab et al., (1994) W09417831; Tatagiba et al, (1997) Neurosurgery 
20 40, 541-546) as well as malignant tumors in the CNS such as glioblastoma (Schwab et 

al., (1993) U.S. Patent No. 5,250,414); Schwab etal., (2000) U.S. Patent No. 
6,025,333). 

Antibodies which recognize Nogo have been suggested to be useful in the 
diagnosis and treatment of nerve damage resulting from trauma, infarction and 
25 degenerative disorders of the CNS (Schnell & Schwab, (1990) Nature 343, 269-272; 
Schwab et al., (1997) U.S. Patent No. 5,684,133). For CNS axons, there is a 
correlation between the presence of myelin and the inhibition of axon regeneration 
over long distances (Savio and Schwab (1990) Proc. Natl. Acad. Sci. 87, 4130-4133; 
Keirstead et al., (1992) Proc. Natl. Acad. Sci. 89, 11664-11668). After Nogo is 
30 blocked by antibodies, neurons can again extend across lesions caused by nerve 
damage (Schnell and Schwab (1990) Nature 343, 269-272). 
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SUMMARY OF THE INVENTION 

Genes encoding homologs (NgR2 and NgR3) of a Nogo receptor (NgRl) in 
mice and humans have been discovered. Various domains in the polypeptides encoded 
by the NgR2 and NgR3 genes have been identified and compared to domains in mouse 
5 and human NgRl polypeptides. This comparison has led to identification of a 

consensus sequence (NgR consensus sequence) that characterizes a family of proteins 
(NgR family). Based on these and other discoveries, the invention features molecules 
and methods for modulating axonal growth in CNS neurons. 

The invention provides a polypeptide that contains a polypeptide containing a 
10 tryptophan rich LRRCT domain consisting of the amino acid sequence: 

N X x W X 2 C X 3 C R A R X 4 L W X 5 W X 6 X 7 X 8 Xg R X 10 S S S X n V 


X 12 C X 13 X 14 P X 15 X 16 X 17 X lg X 19 X 20 D L X 21 X 22 L X 23 X 24 X 25 D 
15 

X 26 X 27 X 28 C [SEQ ID NO: 19] 

wherein X is any protein amino acid or a gap, and the polypeptide does not 
include amino acid sequence from residue 260 to 309 of SEQ ID NO: 5 
20 (human NgRl) or SEQ ID NO: 17 (mouse NgRl). 


25 


30 


Preferably, X17 and X23 are (independently) arginine or lysine. In some 
embodiments, the amino acid sequence of the LRRCT domain is residues 261-310 of 
SEQ ID NO:2, or residues 261-310 of SEQ ID NO: 2 with up to 10 conservative 
amino acid substitutions. In some embodiments, the polypeptide contains the 
following NTLRRCT amino acid sequence: 


C P X 3 X 2 C X 3 C Y X 4 X 5 P X 6 X 7 T X 8 S C X 9 X 10 X u X 12 X 13 X 14 X 15 X 16 P 
X 17 X 18 X 19 P X 20 X 21 X 22 X 23 R X 24 F L X 25 X 26 N X 27 I X 28 X 29 X 30 X 31 X 32 X 33 
X 34 F X 35 X 36 X 37 X 38 X 39 X 40 X 41 X 42 L W X 43 X 44 S N X 45 X 46 X 47 X 48 1 X 49 
X 50 X 51 X 52 F X 53 X 54 X 55 X 56 X 57 L E X 58 L D L X 59 D N X 60 X 61 L X 62 X 63 X 64 
X 65 P X 66 T F X 67 G L X 68 X 69 L X 70 X 71 L X 72 L X 73 X 74 C X 75 L X 76 X 77 L X 78 
X 79 X 80 X 81 F X 82 GL X 83 X 84 L Q Y L YL Q X 85 N X 86 X 87 X 88 X S9 L X 90 D 
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X 91 X 92 F X 93 D L X 94 N L X 95 H L F L H G N X 96 X 97 X 98 X 99 X 100 X 101 X 102 
X 103 X 104 F R G L X 105 X 106 L D R L L L H X 107 N X 108 X 109 X 110 X 1U V H X n2 
X 113 A F X 114 X 115 L X 116 R L X 117 X 118 L X 119 L F X 120 N X 121 L X 122 X 123 L 

X 124 X 125 x 126 x 127 l X 128 X 129 l x 130 x 131 l x 132 x 133 lrln x 134 n x 135 w 
X 136 C X 137 C R X 138 R X 139 L W X 140 W X 141 X 142 X 143 X 144 R X 145 S S S X 146 
V X 147 C X 148 X 149 P X 150 X 151 X 152 X 153 X 154 X 155 D L X 156 X 157 L X 158 X 159 X 160 
D X 161 X 162 X 163 C [SEQ ID NO: 18] 


wherein X is any amino acid residue or a gap and wherein the polypeptide is not the 
10 polypeptide of SEQ ID NO: 5 (human NgRl) or SEQ ID NO: 17 (mouse NgRl). For 
example, X 6 , X 37 and X 3S may represent a gap. Specific examples of polypeptides of 
the invention are SEQ ID NO: 2 (human NgR2), SEQ ID NO: 4 (mouse NgR3), and 
SEQ ID NO: 14 (human NgR3). In some embodiments, the polypeptide contains: (a) 
a NTLRRCT domain, and (b) less than a complete CTS domain, provided that a partial 
15 CTS domain, if present, consists of no more than the first 39 amino acids of the CTS 
domain. While the polypeptide may contain a functional GPI domain, a functional GPI 
domain may be absent, e.g., when a soluble polypeptide is desired. A polypeptide of 
the invention optionally includes an amino acid sequence of a heterologous 
polypeptide, e.g., an Fc portion of an antibody. 

20 The invention also provides a nucleic acid encoding an above-described 

polypeptide; a vector containing the nucleic acid, which nucleic acid may be operably 
linked to an expression control sequence; and a transformed host cell containing the 
vector. A method of producing a polypeptide of the invention is also provided. The 
method includes introducing a nucleic acid encoding the above-described polypeptide 
25 into a host cell, culturing the cell under conditions suitable for expression of the 
polypeptide, and recovering the polypeptide. 

The invention also provides an antisense molecule whose nucleotide sequence 
is complementary to a nucleotide sequence encoding a polypeptide selected from the 
group consisting of: a polypeptide consisting of residues 311-395 of SEQ ID NO: 2, a 
30 polypeptide consisting of residues 256-396 of SEQ ID NO: 14 and a polypeptide 

consisting of residues 321-438 of SEQ ID NO: 4, wherein the nucleic acid is from 8 to 
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100 nucleotides in length, e.g., about 20, 30, 40, 50, 60, 70, 80 or 90 nucleotides. The 
invention also provides a nucleic acid encoding such an antisense molecule. 

The invention also provides an antibody that binds to an above-described 
polypeptide. Polypeptides or antibodies of the invention can be formulated into 
5 pharmaceutical compositions containing the polypeptide or antibody and a 
pharmaceutically acceptable carrier. 

The invention also provides a method for decreasing inhibition of axonal 
growth of a CNS neuron. The method includes the step of contacting the neuron with 
an effective amount of a polypeptide or antibody of the invention. 

10 The invention also provides a method for treating a central nervous system disease, 

disorder or injury. The method includes administering to a mammal, e.g., a human, an 
effective amount of a polypeptide or antibody of the invention. Exemplary diseases, 
disorders and injuries that may be treated using molecules and methods of the 
invention include, but are not limited to, cerebral injury, spinal cord injury, stroke, 

15 demyelinating diseases, e.g., multiple sclerosis, monophasic demyelination, 
encephalomyelitis, multifocal leukoencephalopathy, panencephalitis, 
Marchiafava-Bignami disease. Spongy degeneration, Alexander's disease, Canavan's 
disease, metachromatic leukodystrophy and Rrabbe's disease. 

The invention also provides a method for identifying a molecule that binds a 
20 ' polypeptide of the invention. The method includes the steps of: (a) providing a 
polypeptide of the invention; (b) contacting the polypeptide with the candidate 
molecule; and (c) detecting binding of the candidate molecule to the polypeptide. 

Unless otherwise defined, all technical and scientific terms used herein have the 
same meaning as commonly understood by one of ordinary skill in the art to which the 
25 invention belongs. In case of conflict, the present application, including definitions, 
will control. All publications, patent and other references mentioned herein are 
incorporated by reference. 

The materials, methods and examples presented below are illustrative only, and 
not intended to be limiting. Other features and advantages of the invention will be 
30 apparent from the detail description and from the claims. 


BRIEF DESCRIPTION OF THE FIGURES 
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Fig. 1 A-1B shows an alignment of NgR2 (SEQ ID NO:2) and NgR3 (SEQ ID 
NO: 4) with the known NgR, NgRl (SEQ ID NO: 5) and the Consensus Sequence 
(SEQ ID NO:6). 

5 

Fig. 2. mNgR3 does not bind hNogoA(1055-l 120). COS-7 cells were 
transfected with vectors encoding myc-NgRl or myc-NgR3, fixed, and stained with 
anti-myc antibodies or AP-hNogoA(1055-l 120). 

10 Fig.3. An alignment of the amino acid sequences of human NgRl, murine 

NgRl, murine NgR3, human NgR3 and human NgR2. Numbering begins with amino 
acid #1 of murine NgR3. The consensus sequence is listed below. The LRR NT 
domain is indicated by a shaded box; domains LLR 1, LLR 3, LLR 5, and LLR 7 are 
indicated by open boxes; LLR 2, LLR 4, LLR 6 and LLR 8 are indicated by shaded 

15 boxes; and the LLR CT domain is indicated by a shaded box. Amino acids in bold in 
LLR 8 indicate a conserved glycosylation sites. A dot indicates conserved cystine 
residue in LRR4. Box at C terminus indicates putative GPI signals. 

20 DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides purified and isolated polynucleotides (e.g . , 
DNA sequences and RNA transcripts, both sense and complementary antisense 
strands, both single- and double-stranded, including splice variants thereof) encoding 
NgR homologs, referred to herein as NgR. Unless indicated otherwise, as used herein, 

25 the abbreviation in lower case (NgR) refers to a gene, cDNA, RNA or nucleic acid 
sequence, whereas the upper case version (NgR) refers to a protein, polypeptide, 
peptide, oligopeptide, or amino acid sequence. Specific proteins are designated by 
number, e.g., "NgR2" is a human NgR homolog, "NgR3" is a murine-derived NgR 
homolog, and "NgRl" is the known NgR identified by Dr. Stephen Strittmatter. 

30 Known NgRs are herein referred to as "NgRs." DNA polynucleotides of the invention 
include genomic DNA cDNA and DNA that has been chemically synthesized in whole 
or in part. 
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Standard reference works setting forth the general principles of recombinant 
DNA technology known to those of skill in the art include Ausubel et al., CURRENT 
Protocols in Molecular Biology, John Wiley & Sons, New York (1998); 
Sambrook et al. , Molecular Cloning: A Laboratory Manual, 2d Ed., Cold 
5 Spring Harbor Laboratory Press, Plainview, New York (1989); Kaufman et al., Eds., 
Handbook of Molecular and Cellular Methods in Biology and Medicine, 
CRC Press, Boca Raton (1995); McPherson, Ed., Directed Mutagenesis: A 
Practical Approach, IRL Press, Oxford (1991). 

As used herein, the term "axon" refers to a long cellular protrusion from a 
10 neuron, whereby action potentials are conducted, either to or from the cell body. 

As used herein, the term "axonal growth" refers to an extension of the long 
process or axon, originating at the cell body and proceeded by the growth cone. 

As used herein, the term "central nervous system disorder" refers to any 
pathological state associated with abnormal function of the central nervous system 
15 (CNS). The term includes, but is not limited to, altered CNS function resulting from 

physical trauma to cerebral tissue, viral infection, autoimmune machanisms and genetic 
mutation. 

As used herein, the term "demyelinating disease" refers to a pathological 
disorder characterized by the degradation of the myelin sheath of the oligodendrocyte 
20 cell membrane. 

As used herein, the term "growth cone" refers to a specialized region at the tip 
of a growing neurite that is responsible for sensing the local environment and moving 
the axon toward its appropriate synaptic target cell. 

As used herein, the term "growth cone movement" refers to the extension or 
25 collapse of the growth cone toward a neuron's target cell. 

As used herein, the term "neurite" refers to a process growing out of a neuron. 
As it is sometimes difficult to distinguish a dendrite from in axon in culture, the term 
"neurite" is used for both. 

As used herein, the term "oligodendrocyte" refers to a neuroglial cell of the 
30 CNS whose function is to myelinate CNS axons. 

"Synthesized" as used herein and understood in the art, refers to 
polynucleotides produced by purely chemical, as opposed to enzymatic, methods. 
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"Wholly" synthesized DNA sequences are therefore produced entirely by chemical 
means, and "partially" synthesized DNAs embrace those wherein only portions of the 
resulting DNA were produced by chemical means. By the term "region" is meant a 
physically contiguous portion of the primary structure of a biomolecule. In the case of 
5 proteins, a region is defined by a contiguous portion of the amino acid sequence of that 
protein. The term "domain" is herein defined as referring to a structural part of a 
biomolecule that contributes to a known or suspected function of the biomolecule. 
Domains may be co-extensive with regions or portions thereof; domains may also 
incorporate a portion of a biomolecule that is distinct from a particular region, in 
10 addition to all or part of that region. Examples of NgR protein domains include, but 
are not limited to, the signal peptide, extracellular (i.e., N-terminal) domain, and 
leucine-rich repeat domains. 

As used herein, the term "activity" refers to a variety of measurable indicia 
suggesting or revealing binding, either direct or indirect; affecting a response, i.e., 

15 having a measurable affect in response to some exposure or stimulus, including, for 
example, the affinity of a compound for directly binding a polypeptide or 
polynucleotide of the invention, or, for example, measurement of amounts of upstream 
or downstream proteins or other similar functions after some stimulus or event. Such 
activities may be measured by assays such as competitive inhibition of NgRl binding to 
20 Nogo assays wherein, for example, unlabeled, soluble NgR2 is added to an assay 

system in increasing concentrations to inhibit the binding of Nogo to NgRl expressed 
on the surface of CHO cells. As another example, one may assess the ability of 
neurons to extend across lesions caused by nerve damage (as in Schnell and Schwab 
(1990) Nature 343, 269-272) following inhibition of Nogo by various forms of NgR2 
25 and/or NgR3 as a biological indicator of NgR function. 

As used herein, the term "antibody" is meant to refer to complete, intact 
antibodies, and Fab, Fab', F(ab)2, and other fragments thereof. Complete, intact 
antibodies include monoclonal antibodies such as murine monoclonal antibodies, 
chimeric antibodies, anti-idiotypic antibodies, anti-anti-idiotypic antibodies, and 
30 humanized antibodies. 

As used herein, the term "binding" means the physical or chemical interaction 
between two proteins or compounds or associated proteins or compounds or 
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combinations thereof. Binding includes ionic, non-ionic, hydrogen bonds, Van der 
Waals, hydrophobic interactions, etc. The physical interaction, the binding, can be 
either direct or indirect, indirect being through or due to the effects of another protein 
or compound. Direct binding refers to interactions that do not take place through or 
5 due to the effect of another protein or compound but instead are without other 
substantial chemical intermediates. 

As used herein, the term "compound" means any identifiable chemical or 
molecule, including, but not limited to, small molecules, peptides, proteins, sugars, 
nucleotides or nucleic acids, and such compound can be natural or synthetic. 

10 As used herein, the term "complementary" refers to Watson-Crick basepairing 

between nucleotide units of a nucleic acid molecule. 

As used herein, the term "contacting" means bringing together, either directly 
or indirectly, a compound into physical proximity to a polypeptide or polynucleotide of 
the invention. The polypeptide or polynucleotide can be in any number of buffers, 

15 salts, solutions etc. Contacting includes, for example, placing the compound into a 

beaker, microtiter plate, cell culture flask, or a microarray, such as a gene chip, or the 
like, which contains the nucleic acid molecule, or polypeptide encoding the NgR or 
fragment thereof. 

As used herein, the phrase "homologous nucleotide sequence," or "homologous 
20 amino acid sequence," or variations thereof, refers to sequences characterized by an 
identity at the nucleotide level, or a homology at the amino acid level, of at least the 
specified percentage. Homologous nucleotide sequences include those sequences 
coding for isoforms of proteins. Such isoforms can be expressed in different tissues of 
the same organism as a result of, for example, alternative splicing of RNA. 

25 Alternatively, isoforms can be encoded by different genes. Homologous nucleotide 

sequences include nucleotide sequences encoding for a protein of a species other than 
humans, including, but not limited to, mammals. Homologous nucleotide sequences 
also include, but are not limited to, naturally occurring allelic variations and mutations 
of the nucleotide sequences set forth herein. A homologous nucleotide sequence does 
30 not, however, include the nucleotide sequence encoding NgRl. Homologous amino 
acid sequences include those amino acid sequences which contain conservative amino 
acid substitutions and which polypeptides have the same binding and/or activity. A 
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homologous amino acid sequence does not, however, include the amino acid sequence 
encoding other known NgRs. Percent homology can be determined by, for example, 
the Gap program (Wisconsin Sequence Analysis Package, Version 8 for Unix, 

Genetics Computer Group, University Research Park, Madison WI), using the default 
5 settings, which uses the algorithm of Smith and Waterman (Adv. Appl. Math., 1981, 2, 
482-489, which is incorporated herein by reference in its entirety). 

As used herein, the term "isolated" nucleic acid molecule refers to a nucleic 
acid molecule (DNA or RNA) that is substantially free of nucleic acids encoding other 
proteins with which it is associated in nature, i.e., a nucleic acid that has been removed 
10 from its native environment. Examples of isolated nucleic acid molecules include, but 
are not limited to, recombinant DNA molecules contained in a vector, recombinant 
DNA molecules maintained in a heterologous host cell, partially or substantially 
purified nucleic acid molecules, and synthetic DNA or RNA molecules. Preferably, an 
"isolated" nucleic acid is free of sequences which naturally flank the nucleic acid (i.e., 
15 sequences located at the 5' and 3' ends of the nucleic acid) in the genomic DNA of the 
organism from which the nucleic acid is derived. For example, in various 
embodiments, the isolated NgR nucleic acid molecule can contain less than about 50 
kb, 25 kb, 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0. 1 kb of nucleotide sequences which 
naturally flank the nucleic acid molecule in genomic DNA of the cell from which the 
20 nucleic acid is derived. Moreover, an "isolated" nucleic acid molecule, such as a 

cDNA molecule, can be substantially free of other cellular material or culture medium 
when produced by recombinant techniques, or of chemical precursors or other 
chemicals when chemically synthesized. 

As used herein, the term "heterologous" refers to a nucleotide or amino acid 
25 sequence that is a different, or non-corresponding sequence, or a sequence derived 

from a different species. For example, a mouse NgR nucleotide or amino acid sequence 
is heterologous to a human NgR nucleotide or amino acid sequence, and a human NgR 
nucleic or amino acid sequence is heterologous to a human immunoglobulin nucleotide 
or amino acid sequence. 

30 As used herein, a "soluble NgR polypeptide" is a NgR polypeptide that does 

not anchor itself in a membrane. Such soluble polypeptides include, for example, 

NgR2 and NgR3 polypeptides that lack a sufficient portion of their GPI anchor signal 
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to anchor the polypeptide or are modified such that the GPI anchor signal is not 
adequate to result in replacement of the peptide with a GPI anchor. In preferred 
embodiments, up to 5, 10, 20 or 25 amino acids are removed from the C-terminus of 
NgR2 or NgR3 to make the respective proteins soluble. As used herein soluble NgR 
5 polypeptides include full-length or truncated (e.g., with internal deletions) NgR. 

Soluble NgR polypeptides may include the entire NgR protein up to the 
putative GPI signal sequence (e.g., amino acid 1 to about amino acid 395 of NgR2, 
and from amino acid 1 to about amino acid 438 ofNgR3). In other embodiments, the 
signal peptide of the proteins may be removed or truncated (e.g , all or part of the 
10 signal sequence of NgR2, which spans amino acid 1 to about amino acid 30 of SEQ ID 
NO:2, may be removed; all or part of the signal sequence of NgR3, which spans amino 
acid 1 to about amino acid 40 of SEQ ID NO:4, may be removed). In some 
embodiments, the mature NgR2 (SEQ ID NO: 8) and the mature NgR3 (SEQ ID 
NO: 9) are used. 

15 Soluble NgR polypeptides include at least one of the putative ligand-binding 

portions of NgR, including the first cysteine-rich region (SEQ ID NO: 10, the leucine 
repeat region (SEQ ID NO: 12) and the second cysteine-rich region (SEQ ID NO: 1 1). 
In some embodiments, soluble NgR polypeptides consist of amino acid 1 through 
about amino acid 395 of SEQ ID NO:2, or amino acid 1 through about amino acid 
20 438 of SEQ ID NO:4. 

In other embodiments, the soluble NgR polypeptides are fusion proteins that 
contain amino acids 30 through about amino acid 395 of mature NgR2 or amino acid 
40 through about amino acid 438 ofNgR3, the C-terminal 10 amino acids of a human 
IgG 1 hinge region containing the two cysteine residues thought to participate in 
25 interchain disulfide bonding, and the CH2 and CH3 regions of a human IgGI heavy 
chain constant domain. This type of recombinant protein is designed to modulate 
inhibition of axonal elongation through inhibition of the Nogo ligand binding to NgRl, 
or by inhibiting the ligand of the NgR from interacting with cell surface NgR. The 
NgR portion of the fusion binds to the Nogo ligand and the IgGI portion binds to the 
30 FcyRI (macrophage) and Fcylll (NK cells and neutrophils) receptors. 

The production of the soluble polypeptides useful in this invention may be 
achieved by a variety of methods known in the art. For example, the polypeptides may 
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be derived from intact transmembrane NgR molecules by proteolysis using specific 
endopeptidases in combination with exopeptidases, Edman degradation, or both. The 
intact NgR molecule, in turn, may be purified from its natural source using 
conventional methods. Alternatively, the intact NgR may be produced by known 
5 recombinant DNA techniques using cDNAs, expression vectors and well-known 
techniques for recombinant gene expression. 

Preferably, the soluble polypeptides useful in the present invention are 
produced directly, thus eliminating the need for an entire NgR as a starting material. 
This may be achieved by conventional chemical synthesis techniques or by well-known 
10 recombinant DNA techniques wherein only those DNA sequences which encode the 
desired peptides are expressed in transformed hosts. For example, a gene which 
encodes the desired soluble NgR polypeptide may be synthesized by chemical means 
using an oligonucleotide synthesizer. Such oligonucleotides are designed based on the 
amino acid sequence of the desired soluble NgR polypeptide. Specific DNA sequences 
15 coding for the desired peptide also can be derived from the full- length DNA sequence 
by isolation of specific restriction endonuclease fragments or by PCR synthesis of the 
specified region from cDNA. 

A nucleic acid molecule of the present invention, e.g., a nucleic acid molecule 
having the nucleotide sequence of SEQ ID NOs: 1, 3 or a complement of either of 
20 these nucleotide sequences, can be isolated using standard molecular biology 

techniques and the sequence information provided herein. Using all or a portion of the 
nucleic acid sequences of SEQ ID NOs: 1 or 3 as a hybridization probe, NgR nucleic 
acid sequences can be isolated using standard hybridization and cloning techniques 
{e.g., as described in Sambrook et al., eds.. Molecular Cloning: a Laboratory 
25 MANUAL 2 nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 

1989; and Ausubel, etal., eds., Current Protocols in Molecular Biology, John 
Wiley & Sons, New York, NY, 1993). 

A nucleic acid of the invention can be amplified using cDNA, mRNA or 
alternatively, genomic DNA, as a template and appropriate oligonucleotide primers 
30 according to standard PCR amplification techniques. The nucleic acid so amplified can 
be cloned into an appropriate vector and characterized by DNA sequence analysis. 
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Furthermore, oligonucleotides corresponding to NgR nucleotide sequences can be 
prepared by standard synthetic techniques, e.g., using an automated DNA synthesizer. 

As used herein, the terms "modulates" or "modifies" means an increase or 
decrease in the amount, quality, or effect of a particular activity or protein. 

5 As used herein, the term "oligonucleotide" refers to a series of linked 

nucleotide residues which has a sufficient number of bases to be used in a polymerase 
chain reaction (PCR). This short sequence is based on (or designed from) a genomic 
or cDNA sequence and is used to amplify, confirm or reveal the presence of an 
identical, similar or complementary DNA or RNA in a particular cell or tissue. 

10 Oligonucleotides comprise portions of a DNA sequence having at least about 10 

nucleotides and as many as about 50 nucleotides, preferably about 15 to 30 
nucleotides. They are chemically synthesized and may be used as probes. 

As used herein, the term "probe" refers to nucleic acid sequences of variable 
length, preferably between at least about 10 and as many as about 6,000 nucleotides, 

15 depending on use. They are used in the detection of identical, similar or 

complementary nucleic acid sequences. Longer length probes are usually obtained from 
a natural or recombinant source, are highly specific and much slower to hybridize than 
oligomers. They may be single- or double-stranded and carefully designed to have 
specificity in PCR, hybridization membrane-based, or ELIS A-like technologies. 

20 The term "preventing" refers to decreasing the probability that an organism 

contracts or develops an abnormal condition. 

The term "treating" refers to having a therapeutic effect and at least partially 
alleviating or abrogating an abnormal condition in the organism. 

The term "therapeutic effect" refers to the inhibition or activation factors 
25 causing or contributing to the abnormal condition. A therapeutic effect relieves to 

some extent one or more of the symptoms of the abnormal condition. In reference to 
the treatment of abnormal conditions, a therapeutic effect can refer to one or more of 
the following: (a) an increase in the proliferation, growth, and/or differentiation of 
cells; (b) inhibition (i.e., slowing or stopping) of cell death; (c) inhibition of 
30 degeneration; (d) relieving to some extent one or more of the symptoms associated 

with the abnormal condition; and (e) enhancing the function of the affected population 
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of cells. Compounds demonstrating efficacy against abnormal conditions can be 
identified as described herein. 

The term "abnormal condition" refers to a function in the cells or tissues of an 
organism that deviates from their normal functions in that organism. An abnormal 
5 condition can relate to cell proliferation, cell differentiation, cell signaling, or cell 

survival. An abnormal condition may also include obesity, diabetic complications such 
as retinal degeneration, and irregularities in glucose uptake and metabolism, and fatty 
acid uptake and metabolism. 

Abnormal cell proliferative conditions, for example, include cancers such as 
10 fibrotic and mesangial disorders, abnormal angiogenesis and vasculogenesis, wound 
healing, psoriasis, diabetes mellitus and inflammation. 

Abnormal differentiation conditions include, for example, neurodegenerative 
disorders, slow wound healing rates and slow tissue grafting healing rates. 

Abnormal cell signaling conditions include, for example, psychiatric disorders 
15 involving excess neurotransmitter activity. 

Abnormal cell survival conditions may also relate to conditions in which 
programmed cell death (apoptosis) pathways are activated or abrogated. A number of 
protein kinases are associated with the apoptosis pathways. Aberrations in the 
function of any one of the protein kinases could lead to cell immortality or premature 
20 cell death. 

The term "administering" relates to a method of incorporating a compound into 
cells or tissues of an organism. The abnormal condition can be prevented or treated 
when the cells or tissues of the organism exist within the organism or outside of the 
organism. Cells existing outside the organism can be maintained or grown in cell 
25 culture dishes. For cells harbored within the organism, many techniques exist in the art 
to administer compounds, including (but not limited to) oral, parenteral, dermal, 
injection, and aerosol applications. For cells outside of the organism, multiple 
techniques exist in the art to administer the compounds, including (but not limited to) 
cell microinjection techniques, transformation techniques and carrier techniques. 

30 The abnormal condition can also be prevented or treated by administering a 

compound to a group of cells having an aberration in a signal transduction pathway to 
an organism. The effect of administering a compound on organism function can then 
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be monitored. The organism is preferably a mouse, rat, rabbit, guinea pig or goat, 
more preferably a monkey or ape, and most preferably a human. 

By "amplification" it is meant increased numbers of DNA or RNA in a cell 
compared with normal cells. "Amplification" as it refers to RNA can be the detectable 
5 presence of RNA in cells, since in some normal cells there is no basal expression of 
RNA. In other normal cells, a basal level of expression exists, therefore in these cases 
amplification is the detection of at least 1 -2-fold, and preferably more, compared to 
the basal level. 

The amino acid sequences are presented in the amino to carboxy direction, 

10 from left to right. The amino and carboxy groups are not presented in the sequence. 
The nucleotide sequences are presented by single strand only, in the 5' to 3' direction, 
from left to right. Nucleotides and amino acids are represented in the manner 
recommended by the IUPAC-IUB Biochemical Nomenclature Commission or (for 
amino acids) by three letters code. 

15 

Nucleic Acids 

Genomic DNA of the invention comprises the protein-coding region for a 
polypeptide of the invention and is also intended to include allelic variants thereof. It is 
widely understood that, for many genes, genomic DNA is transcribed into RNA 
20 transcripts that undergo one or more splicing events wherein intron (/. e . , non-coding 
regions) of the transcripts are removed, or "spliced out." RNA transcripts that can be 
spliced by alternative mechanisms, and therefore be subject to removal of different 
RNA sequences but still encode a NgR polypeptide, are referred to in the art as splice 
variants which are embraced by the invention. Splice variants comprehended by the 
25 invention therefore are encoded by the same original genomic DNA sequences but 
arise from distinct mRNA transcripts. Allelic variants are modified forms of a 
wild-type gene sequence, the modification resulting from recombination during 
chromosomal segregation or exposure to conditions which give rise to genetic 
mutation. Allelic variants, like wild-type genes, are naturally occurring sequences (as 
30 opposed to non-naturally occurring variants arising from in vitro manipulation). 

The invention also comprehends cDNA that is obtained through reverse 
transcription of an RNA polynucleotide encoding NgR (conventionally followed by 
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second-strand synthesis of a complementary strand to provide a double-stranded 
DNA). 

Preferred DNA sequences encoding a human NgR polypeptide is set out in 
SEQ ID NOs: 1 and 13. A preferred DNA of the invention comprises a double 
5 stranded molecule comprising the coding molecule (i. e . , the "coding strand") along 

with the complementary molecule (the "non-coding strand" or "complement") having a 
sequence unambiguously deducible from the coding strand according to Watson-Crick 
base-pairing rules for DNA. Also preferred are other polynucleotides encoding NgR 
polypeptides, as shown in SEQ ID NO:3, which comprises murine NgR homolog, 

10 NgR3. 

Also preferred are nucleotide sequences that encode at least a portion of a NgR 
polypeptide that has at least one biological function of a NgR. More preferred are 
nucleotide sequences that encode a portion of NgR that encodes at least the mature 
NgR without the hydrophobic C-terminal GPI signal. Also preferred are nucleotide 
15 sequences that encode the portion of NgR that encodes at least the ligand-binding 
region of NgR. 

The invention further embraces other species, preferably mammalian, homologs 
of the human NgR DNA. Species homologs, sometimes referred to as "orthologs," in 
general, share at least 35%, at least 40%, at least 45%, at least 50%, at least 60%, at 
20 least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 
95%, at least 98%, or at least 99% homology with human DNA of the invention. 
Generally, percent sequence "homology" with respect to polynucleotides of the 
invention may be calculated as the percentage of nucleotide bases in the candidate 
sequence that are identical to nucleotides in the NgR sequences set forth in SEQ ID 
25 NOs:l, 3 or 13, after aligning the sequences and introducing gaps, if necessary, to 
achieve the maximum percent sequence identity. 

The polynucleotide sequence information provided by the invention makes 
possible large-scale expression of the encoded polypeptide by techniques well known 
and routinely practiced in the art. Polynucleotides of the invention also permit 
30 identification and isolation of polynucleotides encoding related NgR polypeptides, such 
as human allelic variants and species homologs, by well-known techniques including 
Southern and/or Northern hybridization, and polymerase chain reaction (PCR). 
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Examples of related polynucleotides include human and non-human genomic 
sequences, including allelic variants, as well as polynucleotides encoding polypeptides 
homologous to NgR and structurally related polypeptides sharing one or more 
biological, immunological, and/or physical properties of NgR. Non-human species 
5 genes encoding proteins homologous to NgR can also be identified by Southern and/or 
PCR analysis and are useful in animal models for NgR disorders. Knowledge of the 
sequence of a human NgR DNA also makes possible through use of Southern 
hybridization or polymerase chain reaction (PCR) the identification of genomic DNA 
sequences encoding NgR expression control regulatory sequences such as promoters, 
10 operators, enhancers, repressors, and the like. Polynucleotides of the invention are 
also useful in hybridization assays to detect the capacity of cells to express NgR. 
Polynucleotides of the invention may also provide a basis for diagnostic methods useful 
for identifying a genetic alteration(s) in a NgR locus that underlies a disease state or 
states, which information is useful both for diagnosis and for selection of therapeutic 
15 strategies. 

The disclosure herein of a full-length polynucleotide encoding a NgR 
polypeptide makes readily available to the worker of ordinary skill in the art every 
possible fragment of the full-length polynucleotide. The invention, therefore, provides 
fragments of NgR-encoding polynucleotides comprising at least 6, and preferably at 
20 least 14, 16, 18, 20, 25, 50, or 75 consecutive nucleotides of a polynucleotide 

encoding NgR. Preferably, fragments of polynucleotides of the invention comprise 
sequences unique to the NgR-encoding polynucleotide sequence, and therefore 
hybridize under highly stringent or moderately stringent conditions only (i. e . , 
"specifically") to polynucleotides encoding NgR (or fragments thereof). 

25 Polynucleotide fragments of genomic sequences of the invention comprise not only 
sequences unique to the coding region, but also include fragments of the full-length 
sequence derived from introns, regulatory regions, and/or other non-translated 
sequences. Sequences unique to polynucleotides of the invention are recognizable 
through sequence comparison to other known polynucleotides, and can be identified 
30 through use of alignment programs routinely utilized in the art, e.g., those made 

available in public sequence databases. Such sequences also are recognizable from 
Southern hybridization analyses to determine the number of fragments of genomic 
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DNA to which a polynucleotide will hybridize. Polynucleotides of the invention can be 
labeled in a manner that permits their detection, including radioactive, fluorescent and 
enzymatic labeling. 

Fragments of polynucleotides are particularly useful as probes for detection of 
5 full-length or fragment of NgR polynucleotides. One or more polynucleotides can be 
included in kits that are used to detect the presence of a polynucleotide encoding NgR, 
or used to detect variations in a polynucleotide sequence encoding NgR. 

The invention also embraces DNAs encoding NgR polypeptides that hybridize 
under moderately stringent or high stringency conditions to the noncoding strand, or 
10 complement, of the polynucleotide in any of SEQ ID NOs: 1 or 3 . 

Stringent conditions are known to those skilled in the art and can be found in 
Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 

6.3. 176.3.6. Preferably, the conditions are such that sequences at least about 65%, 
70%, 75%, 85%, 90%, 95%, 98% or 99% homologous to each other typically remain 
15 hybridized to each other. Anon-limiting example of stringent hybridization conditions 
is hybridization in a high salt buffer comprising 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 
mM EDTA 0.02% PVP, 0.02% Ficoll, 0.02% BSA and 500 mg/ml denatured salmon 
sperm DNA at 65 °C. This hybridization is followed by one or more washes in 0.2X 
SSC, 0.01% BSA at 50°C. An isolated nucleic acid molecule of the invention that 
20 hybridizes under stringent conditions to the sequence of SEQ ID NOs:l or 3 
corresponds to a naturally occurring nucleic acid molecule. As used herein, a 
"naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule having 
a nucleotide sequence that occurs in nature (e.g . , encodes a natural protein). 

As used herein, “stringent hybridization conditions” means: 42°C in a hybridization 
25 solution comprising 50% formamide, 1% SDS, 1 MNaCl, 10% (wt/vol) dextran 

sulfate, and washing twice for 30 minutes at 60°C in a wash solution comprising 0.1 X 
SSC and 1% SDS. 

Vectors 

30 Another aspect of the present invention is directed to vectors, or recombinant 

expression vectors, comprising any of the nucleic acid molecules described above. 
Vectors are used herein either to amplify DNA or RNA encoding NgR and/or to 
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express DNA which encodes NgR. As used herein, the term "vector" refers to a 
nucleic acid molecule capable of transporting another nucleic acid to which it has been 
linked. One type of vector is a "plasmid", which refers to a circular double stranded 
DNA loop into which additional DNA segments can be ligated. Another type of 
5 vector is a viral vector, wherein additional DNA segments can be ligated into the viral 
genome. Certain vectors are capable of autonomous replication in a host cell into 
which they are introduced (e.g, bacterial vectors having a bacterial origin of 
replication and episomal mammalian vectors). Other vectors (e.g., non-episomal 
mammalian vectors) are integrated into the genome of a host cell upon introduction 
10 into the host cell, and thereby are replicated along with the host genome. Moreover, 
certain vectors are capable of directing the expression of genes to which they are 
operatively linked. Such vectors are referred to herein as "expression vectors". In 
general, expression vectors of utility in recombinant DNA techniques are often in the 
form of plasmids. In the present specification, "plasmid" and "vector" can be used 
15 interchangeably as the plasmid is the most commonly used form of vector. However, 
the invention is intended to include such other forms of expression vectors, such as 
viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno- 
associated viruses), that serve equivalent functions. 

Expression of proteins in prokaryotes is most often carried out in E. coli with 
20 vectors containing constitutive or inducible promoters directing the expression of 

either fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a 
protein encoded therein, usually to the amino terminus of the recombinant protein. 
Such fusion vectors typically serve three purposes: (1) to increase expression of 
recombinant protein; (2) to increase the solubility of the recombinant protein; and (3) 
25 to aid in the purification of the recombinant protein by acting as a ligand in affinity 
purification. Often, in fusion expression vectors, a proteolytic cleavage site is 
introduced at the junction of the fusion moiety and the recombinant protein to enable 
separation of the recombinant protein from the fusion moiety subsequent to 
purification of the fusion protein. Such enzymes, and their cognate recognition 
30 sequences, include Factor Xa, thrombin and enterokinase. Typical fusion expression 
vectors include pGEX (Pharmacia Biotech Inc; Smith and Johnson (1988) Gene 67, 
31-40), pMAL (New England Biolabs, Beverly, Mass.) and pRIT5 (Pharmacia, 
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Piscataway, N. J.) that fuse glutathione-S-transferase (GST), maltose E binding protein, 
or protein A, respectively, to the target recombinant protein. 

Examples of suitable inducible non-fusion E. coli expression vectors include 
pTrc (Amrann etal., (1988) Gene 69, 301-315) and pET lid (Studier etal. , Gene 
5 Expression Technology: Methods in Enzymology 185, Academic Press, San 

Diego, CA. (1990) 60-89). 

One strategy to maximize recombinant protein expression in E. coli is to 
express the protein in host bacteria with an impaired capacity to proteolytically cleave 
the recombinant protein. See, Gottesman, GENE EXPRESSION TECHNOLOGY : 

10 Methods in Enzymology 185, Academic Press, San Diego, CA. (1990) 1 19-128. 

Another strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted 
into an expression vector so that the individual codons for each amino acid are those 
preferentially utilized in E. coli (Wada et al., (1992) Nucleic Acids Res. 20, 2111- 
2118). Such alteration of nucleic acid sequences of the invention can be carried out by 
15 standard DNA synthesis techniques. 

In another embodiment, the NgR expression vector is a yeast expression 
vector. Examples of vectors for expression in yeast S. cerevisiae include pYepSecl 
(Baldari, etal., (1987) EMBO J. 6, 229-234), pMFa (Kurjan and Herskowitz (1982) 
Cell 30, 933-943), pJRY88 (Schultz etal., (1987) Gene 54, 113-123), pYES2 
20 (Invitrogen Corporation, San Diego, CA), and picZ (InVitrogen Corp, San Diego, 

CA). 

Alternatively, NgR can be expressed in insect cells using baculovirus expression 
vectors. Baculovirus vectors available for expression of proteins in cultured insect 
cells (e.g., SF9 cells) include the pAc series (Smith et al., (1983) Mol. Cell. Biol. 3, 

25 2156-2165) and the pVL series (Lucklow and Summers (1989) Virology 170, 31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed (1987) Nature 329, 840) and pMT2PC 
(Kaufman et al. (1987) EMBO J. 6, 187-195). When used in mammalian cells, the 
30 expression vector's control functions are often provided by viral regulatory elements. 
For example, commonly used promoters are derived from polyoma, adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
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prokaryotic and eukaryotic cells. See, e.g., Chapters 16 and 17 of Sambrook et al., 
(Eds.) Molecular Cloning: A Laboratory Manual. 2 nd Ed., Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. 

In another embodiment, the recombinant mammalian expression vector is 
5 capable of directing expression of the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al. 
(1987) Genes Dev. 1, 268-277), lymphoid-specific promoters (Calame and Eaton 
10 (1988) Adv. Immunol. 43, 235-275), in particular promoters of T cell receptors 

(Winoto and Baltimore (1989) EMBO J. 8, 729-733) and immunoglobulins (Baneiji et 
al. (1983) Cell 33, 729-740; Queen and Baltimore (1983) Cell 33, 741-748), neuron- 
specific promoters (e.g., the neurofilament promoter; Byrne and Ruddle (1989) Proc. 
Natl. Acad. Sci. USA 86, 5473-5477), pancreas-specific promoters (Edlund et al. 

15 (1985) Science 230, 912-916), and mammary gland-specific promoters (e.g., milk 

whey promoter; U.S. Pat. No. 4,873,3 16 and European Application Publication No. 
264,166), Developmentally-regulated promoters are also encompassed, e.g., the 
murine hox promoters (Kessel and Gruss (1990) Science 249, 374-379) and the 
oc-fetoprotein promoter (Campes and Tilghman (1989) Genes Dev. 3, 537-546). 

20 The invention further provides a recombinant expression vector comprising a 

DNA molecule of the invention cloned into the expression vector in an antisense 
orientation. That is, the DNA molecule is operatively linked to a regulatory sequence 
in a manner that allows for expression (by transcription of the DNA molecule) of an 
RNA molecule that is antisense NgR mRNA. Regulatory sequences operatively linked 
25 to a nucleic acid cloned in the antisense orientation can be chosen that direct the 

continuous expression of the antisense RNA molecule in a variety of cell types, for 
instance viral promoters and/or enhancers, or regulatory sequences can be chosen that 
direct constitutive, tissue-specific or cell-type-specific expression of antisense RNA, 
The antisense expression vector can be in the form of a recombinant plasmid, 

30 phagemid or attenuated virus in which antisense nucleic acids are produced under the 
control of a high efficiency regulatory region, the activity of which can be determined 
by the cell type into which the vector is introduced. For a discussion of the regulation 
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of gene expression using antisense genes see Weintraub et al., Antisense RNA as a 
molecular tool for genetic analysis, Revtews—Trends in Genetics, Vol. 1(1) 1986. 

Preferred vectors include, but are not limited to, plasmids, phages, cosmids, 
episomes, viral particles or viruses and integratable DNA fragments (i.e., fragments 
5 integratable into the host genome by homologous recombination). Preferred viral 
particles include, but are not limited to, adenoviruses, baculoviruses, parvoviruses, 
herpesviruses, poxviruses, adeno-associated viruses, Semliki Forest viruses, vaccinia 
viruses and retroviruses. Preferred expression vectors include, but are not limited to, 
pcDNA3 (Invitrogen) and pSVL (Pharmacia Biotech). Other expression vectors 
10 include, but are not limited to, pSPORT™ vectors, pGEM™ vectors (Promega), 

pPROEXvectors™ (LTI, Bethesda, MD), Bluescript™ vectors (Stratagene), pQE™ 
vectors (Qiagen), pSE420™ (Invitrogen) and pYES2™(Invitrogen). 

Preferred expression vectors are replicable DNA constructs in which a DNA 
sequence encoding NgR is operably linked or connected to suitable control sequences 
15 capable of effecting the expression of the NgR in a suitable host. DNA regions are 
operably linked or connected when they are functionally related to each other. For 
example, a promoter is operably linked or connected to a coding sequence if it controls 
the transcription of the sequence. Amplification vectors do not require expression 
control domains, but rather need only the ability to replicate in a host, usually 
20 conferred by an origin of replication, and a selection gene to facilitate recognition of 
transformants. The need for control sequences in the expression vector will vary 
depending upon the host selected and the transformation method chosen. Generally, 
control sequences include, but are not limited to a transcriptional promoter, enhancers, 
an optional operator sequence to control transcription, polyadenylation signals, a 
25 sequence encoding suitable mRNA ribosomal binding and sequences which control the 
termination of transcription and translation. Such regulatory sequences are described, 
for example, in Goeddel, GENE EXPRESSION TECHNOLOGY: METHODS IN 
ENZYMOLOGY 185, Academic Press, San Diego, CA (1990). Regulatory sequences 
include those that direct constitutive expression of a nucleotide sequence in many types 
30 of host cell and those that direct expression of the nucleotide sequence only in certain 
host cells (e.g., tissue-specific regulatory sequences). It will be appreciated by those 
skilled in the art that the design of the expression vector can depend on such factors as 
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the choice of the host cell to be transformed, the level of expression of protein desired, 
etc. The expression vectors of the invention can be introduced into host cells to 
thereby produce proteins or peptides, including fusion proteins or peptides, encoded by 
nucleic acids as described herein (e.g., NgR proteins, mutant forms of NgR, fusion 
5 proteins, etc.). 

Preferred vectors preferably contain a promoter that is recognized by the host 
organism. The promoter sequences of the present invention may be prokaryotic, 
eukaryotic or viral. Examples of suitable prokaryotic sequences include the PR and PL 
promoters of bacteriophage lambda (The BACTERIOPHAGE LAMBDA, Hershey, A.D. 

10 (Ed.), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY (1973), which is 

incorporated herein by reference in its entirety; Lambda II, Hendrix, R.W. (Ed.), Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY (1980), which is 
incorporated herein by reference in its entirety); the trp, recA, heat shock, and lacZ 
promoters of E. coli and the SV40 early promoter (Benoist et al., (1981) Nature 290, 
15 304-310, which is incorporated herein by reference in its entirety). Additional 

promoters include, but are not limited to, mouse mammary tumor virus, long terminal 
repeat of human immunodeficiency virus, maloney virus, cytomegalovirus immediate 
early promoter, Epstein Barr virus, Rous sarcoma virus, human actin, human myosin, 
human hemoglobin, human muscle creatine and human metallothionein. 

20 Additional regulatory sequences can also be included in preferred vectors. 

Preferred examples of suitable regulatory sequences are represented by the 
Shine-Dalgarno sequence of the replicase gene of the phage MS-2 and of the gene ell 
of bacteriophage lambda. The Shine-Dalgarno sequence may be directly followed by 
DNA encoding NgR and result in the expression of the mature NgR protein. 

25 Moreover, suitable expression vectors can include an appropriate marker that 

allows the screening of the transformed host cells. The transformation of the selected 
host is carried out using any one of the various techniques well known to the expert in 
the art and described in Sambrook et al., supra. 

An origin of replication can also be provided either by construction of the 
30 vector to include an exogenous origin or may be provided by the host cell 

chromosomal replication mechanism. If the vector is integrated into the host cell 
chromosome, the latter may be sufficient. Alternatively, rather than using vectors 
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which contain viral origins of replication, one skilled in the art can transform 
mammalian cells by the method of co-transformation with a selectable marker and NgR 
DNA. An example of a suitable marker is dihydrofolate reductase (DHFR) or 
thymidine kinase (see, U.S. Patent No. 4,399,216). 

5 Nucleotide sequences encoding NgR may be recombined with vector DNA in 

accordance with conventional techniques, including blunt-ended or staggered-ended 
termini for ligation, restriction emzyme digestion to provide appropriate termini, filling 
in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid undesirable 
joining and ligation with appropriate ligases. Techniques for such manipulation are 
10 disclosed by Sambrook et al., supra and are well known in the art. Methods for 

construction of mammalian expression vectors are disclosed in, for example, Okayama 
et al . , (1983) Mol. Cell. Biol. 3:280, Cosman et al. (1986) Mol. Immunol. 23:935, 
Cosman et al., (1984) Nature 3 12:768, EP-A-0367566, and WO 91/18982, each of 
which is incorporated herein by reference in its entirety. 

15 

Host Cells and Transformed Host Cells 

According to another aspect of the invention, host cells are provided, including 
prokaryotic and eukaryotic cells, comprising a polynucleotide of the invention (or 
vector of the invention) in a manner that permits expression of the encoded NgR 
20 polypeptide. Preferably, the cell produces little or no endogenous NgR polypeptide. 
Polynucleotides of the invention may be introduced into the host cell as part of a 
circular plasmid, or as linear DNA comprising an isolated protein coding region or a 
viral vector. Methods for introducing DNA into the host cell that are well known and 
routinely practiced in the art include transformation, transfection, electroporation, 

25 nuclear injection, or fusion with carriers such as liposomes, micelles, ghost cells and 

protoplasts. Expression systems of the invention include bacterial, yeast, fungal, plant, 
insect, invertebrate, vertebrate and mammalian cells systems. 

Host cells of the invention are a valuable source of immunogen for 
development of antibodies specifically immunoreactive with NgR. Host cells of the 
30 invention are also useful in methods for the large-scale production of NgR 

polypeptides wherein the cells are grown in a suitable culture medium and the desired 
polypeptide products are isolated from the cells, or from the medium in which the cells 
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are grown, by purification methods known in the art, e.g., conventional 
chromatographic methods including immunoaffinity chromatography, receptor affinity 
chromatography, hydrophobic interaction chromatography, lectin affinity 
chromatography, size exclusion filtration, cation or anion exchange chromatography, 

5 high pressure liquid chromatography (HPLC), reverse phase HPLC, and the like. Still 
other methods of purification include those methods wherein the desired protein is 
expressed and purified as a fusion protein having a specific tag, label or chelating 
moiety that is recognized by a specific binding partner or agent. The purified protein 
can be cleaved to yield the desired protein, or can be left as an intact fusion protein. 

10 Cleavage of the fusion component may produce a form of the desired protein having 

additional amino acid residues as a result of the cleavage process. 

Knowledge of NgR DNA sequences allows for modification of cells to permit, 
or increase, expression of endogenous NgR. Cells can be modified (e.g., by 
homologous recombination) to provide increased expression by replacing, in whole or 
15 in part, the naturally occurring NgR promoter with all or part of a heterologous 

promoter so that the cells express NgR at higher levels. The heterologous promoter is 
inserted in such a manner that it is operatively linked to endogenous NgR encoding 
sequences. (See, for example, PCT International Publication No. WO 94/12650, PCT 
International Publication No. WO 92/20808, and PCT International Publication No. 

20 WO 91/09955.) It is also contemplated that, in addition to heterologous promoter 
DNA, amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene 
which encodes carbamoyl phosphate synthase, aspartate transcarbamylase, and 
dihydroorotase) and/or intron DNA may be inserted along with the heterologous 
promoter DNA. If linked to the NgR coding sequence, amplification of the marker 
25 DNA by standard selection methods results in co-amplification of the NgR coding 
sequences in the cells. 

The DNA sequence information provided by the present invention also makes 
possible the development (e.g. , by homologous recombination or "knock-out" 
strategies; see Capecchi, Science 244:1288-1292 (1989)) of animals that fail to express 
30 functional NgR or that express a variant of NgR. Such animals (especially small 

laboratory animals such as rats, rabbits and mice) are useful as models for studying the 
in vivo activities of NgR and modulators of NgR. 
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Suitable host cells for expression of the polypeptides of the invention include, 
but are not limited to, prokaryotes, yeast, and eukaryotes. If a prokaryotic expression 
vector is employed, then the appropriate host cell would be any prokaryotic cell 
capable of expressing the cloned sequences. Suitable prokaryotic cells include, but are 
5 not limited to, bacteria of the genera Escherichia, Bacillus, Salmonella, Pseudomonas, 
Streptomyces and Staphylococcus. 

If a eukaryotic expression vector is employed, then the appropriate host cell 
would be any eukaryotic cell capable of expressing the cloned sequence. Preferably, 
eukaryotic cells are cells of higher eukaryotes. Suitable eukaryotic cells include, but 
10 are not limited to, non-human mammalian tissue culture cells and human tissue culture 
cells. Preferred host cells include, but are not limited to, insect cells, HeLa cells, 
Chinese hamster ovary cells (CHO cells), African green monkey kidney cells (COS 
cells), human 293 cells, and murine 3T3 fibroblasts. Propagation of such cells in cell 
culture has become a routine procedure (see. Tissue Culture, Academic Press, Kruse 
15 and Patterson, Eds. (1973), which is incorporated herein by reference in its entirety). 

In addition, a yeast cell may be employed as a host cell. Preferred yeast cells 
include, but are not limited to, the genera Saccharomyces, Pichia and Kluveromyces. 
Preferred yeast hosts are S. cerevisiae and P. pastoris. Preferred yeast vectors can 
contain an origin of replication sequence from a 2T yeast plasmid, an autonomously 
20 replication sequence (ARS), a promoter region, sequences for polyadenylation, 

sequences for transcription termination and a selectable marker gene. Shuttle vectors 
for replication in both yeast and E. coli are also included herein. 

Alternatively, insect cells may be used as host cells. In a preferred 
embodiment, the polypeptides of the invention are expressed using a baculovirus 
25 expression system (see, Luckow et al., Bio/Technology, 1988, 6, 47; Baculovirus 
Expression Vectors: A Laboratory Manual, O'Rielly et al (Eds.), W.H. 
Freeman and Company, New York, 1992; and U.S. Patent No. 4,879,236, each of 
which is incorporated herein by reference in its entirety). In addition, the MAXBAC™ 
complete baculovirus expression system (Invitrogen) can, for example, be used for 
30 production in insect cells. 

Suitable host cells are discussed further in Goeddel, Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, San Diego, CA 
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(1990). Alternatively, the recombinant expression vector can be transcribed and 
translated in vitro, for example using T7 promoter regulatory sequences and T7 
polymerase. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
5 conventional transformation or transfection techniques. As used herein, the terms 

"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including 
calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 
transfection, lipofection, or electroporation. Suitable methods for transforming or 
10 transfecting host cells can be found in Sambrook, et al. (MOLECULAR CLONING: A 

Laboratory Manual. 2nd ed.. Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1989), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon 
the expression vector and transfection technique used, only a small fraction of cells 
15 may integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Various 
selectable markers include those that confer resistance to drugs, such as G418, 
hygromycin, dihydrofolate reductase (DHFR) and methotrexate. Nucleic acid 
20 encoding a selectable marker can be introduced into a host cell on the same vector as 
that encoding NgR or can be introduced on a separate vector. Cells stably transfected 
with the introduced nucleic acid can be identified by drug selection (e.g., cells that 
have incorporated the selectable marker gene will survive, while the other cells die). 

In a preferred embodiment, the polypeptides of the invention, including forms 
25 of NgR2 and NgR3, soluble forms of NgR, chimeric NgR polypeptides, NgR/Ig 

fusions and fragments and variations of each of the above are expressed in Chinese 
Hamster Ovary (CHO) cells. 

In order to introduce the DNA fragment coding for the NgR protein or 
polypeptide into the CHO cell to express the recombinant NgR protein or polypeptide, 
30 it is necessary to construct the expression vector. 

The vectors for CHO expression include, but are not limited to, pAl-11, pXTl, 
pRc/CMV, pRc/RSV and pcDNAINeo. The promoter is not specifically limited 
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provided it effectively promotes expression in CHO cells. Examples of suitable 
promoters are: SRcc, SV40, LTR, CMV, and HSV-TK. Of these, CMV and Sra 
promoters are preferred. 

In addition to the above-mentioned promoters, the expression vectors may 
5 contain enhancers, splicing signals, polyadenylation signals, selectable markers and an 
SV40 replication origin. Suitable selectable markers include, but are not limited to the 
dihydrofolate reductase (DHFR) gene which provides resistance to methotrexate 
(MTX), the ampicillin resistance gene, and the neomycin resistance gene. 

Examples of the expression vectors each containing the DNA coding for NgR, 
10 portions, fragments and soluble constructs thereof, include the vector (such as one 

described above), into which the promoter is operably linked (preferably upstream) to 
the nucleotide sequence encoding the desired NgR construct; a polyadenylation signal 
downstream from the nucleotide sequence encoding the NgR construct; and, 
preferably, the vector includes an operable DHFR gene. Preferably, the ampicillin 
15 resistant gene is also operably contained in the vector. 

CHO cell lacking the DHFR gene (Urlaub, G. et ah, (1980) Proc. Natl. Acad. 
Sci. USA 77, 4216-4220) and CHO-K1 (Proc. Natl. Acad. Set. USA 60, 1275 (1968)) 
are suitable for use. 

The NgR expression vectors prepared as above are introduced into CHO cells 
20 by any known method, including, but not limited to the calcium phosphate method 

(Graham and van der Eb (1973) Virol. 52, 456-467) and electroporation (Nuemann et 
al., (1982) EMBO J. 1, 841-845). 

Transformants carrying the expression vectors are selected based on the 
above-mentioned selectable markers. Repeated clonal selection of the transformants 
25 using the selectable markers allows selection of stable cell lines having high expression 
of the NgR constructs. Increased MTX concentrations in the selection medium allows 
gene amplification and greater expression of the desired protein. The CHO cell 
containing the recombinant NgR can be produced by cultivating the CHO cells 
containing the NR expression vectors constitutively expressing the NgR constructs. 

30 Media used in cultivating CHO cells includes DMEM medium supplemented 

with about 0.5 to 20% fetal calf serum, DMEM medium and RPMI1640 medium. The 
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pH of the medium is preferably about 6 to 8. Cultivation is preferably at about 30 to 
40°C for about 15 to 72 hours with aeration. 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce (i. e . , express) NgR protein. Accordingly, the 
5 invention further provides methods for producing NgR protein using the host cells of 
the invention. In one embodiment, the method comprises culturing the host cell of 
invention (into which a recombinant expression vector encoding NgR has been 
introduced) in a suitable medium such that NgR protein is produced. In another 
embodiment, the method further comprises isolating NgR from the medium or the host 
10 cell. 

In situations where the NgR polypeptide will be found primarily intracellularly, 
intracellular material (including inclusion bodies for Gram-negative bacteria) can be 
extracted from the host cell using any standard technique known to one of ordinary 
skill in the art. Such methods would encompass, by way of example and not by way of 
15 limitation, lysing the host cells to release the contents of the periplasm/cytoplasm by 
French press, homogenization and/or sonication followed by centrifugation. 

If the NgR polypeptide has formed inclusion bodies in the cytosol, such 
inclusion bodies may frequently bind to the inner and/or outer cellular membranes. 
Upon centrifugation, the inclusion bodies will be found primarily in the pellet material. 
20 The pellet material can then be treated at pH extremes or with one or more chaotropic 
agents such as a detergent, guanidine, guanidine derivatives, urea, or urea derivatives 
in the presence of a reducing agent such as dithiothreitol at alkaline pH or 
tris-carboxyethyl phosphine at acid pH to release, break apart and solubilize the 
inclusion bodies. Once solubilized, NgR polypeptide can be analyzed using gel 
25 electrophoresis, immunoprecipitation or the like. Various methods of isolating the 
NgR polypeptide would be apparent to one of ordinary skill in the art, for example, 
isolation may be accomplished using standard methods such as those set forth below 
and in Marston et al (1990) Meth. Enzymol. 182, 264-275 (incorporated by reference 
herein in its entirety). 

30 If isolated NgR polypeptide is not biologically active following the isolation 

procedure employed, various methods for "refolding" or converting the polypeptide to 
its tertiary structure and generating disulfide linkages, can be used to restore biological 
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activity. Methods known to one of ordinary skill in the art include adjusting the pH of 
the solubilized polypeptide to a pH usually above 7 and in the presence of a particular 
concentration of a chaotrope. The selection of chaotrope is very similar to the choices 
used for inclusion body solubilization but usually at a lower concentration and is not 
5 necessarily the same chaotrope as used for the solubilization. It may be required to 

employ a reducing agent or the reducing agent plus its oxidized form in a specific ratio, 
to generate a particular redox potential allowing for disulfide shuffling to occur in the 
formation of the protein's cysteine bridge(s). Some of the commonly used redox 
couples include cysteine/cystamine, glutathione (GSH)/dithiobis GSH, cupric chloride, 
10 dithiothreitol (DTT)/dithiane DTT, 2-mercaptoethanol (bME)/dithio-b(ME). To 

increase the efficiency of the refolding, it may be necessary to employ a co solvent, such 
as glycerol, polyethylene glycol of various molecular weights and arginine. 

Transgenic Animals 

1 5 The host cells of the invention can also be used to produce non-human 

transgenic animals. For example, in one embodiment, a host cell of the invention is a 
fertilized oocyte or an embryonic stem cell into which NgR-coding sequences have 
been introduced. Such host cells can then be used to create non-human transgenic 
animals in which exogenous NgR sequences have been introduced into their genome or 
20 homologous recombinant animals in which endogenous NgR sequences have been 

altered. Such animals are useful for studying the function and/or activity of NgR and 
for identifying and/or evaluating modulators of NgR activity. As used herein, a 
"transgenic animal" is a non-human animal, preferably a mammal, more preferably a 
rodent such as a rat or mouse, in which one or more of the cells of the animal includes 
25 a transgene. Other examples of transgenic animals include non-human primates, sheep, 
dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous DNA that is 
integrated into the genome of a cell from which a transgenic animal develops and that 
remains in the genome of the mature animal, thereby directing the expression of an 
encoded gene product in one or more cell types or tissues of the transgenic animal. As 
30 used herein, a "homologous recombinant animal" is a non-human animal, preferably a 
mammal, more preferably a mouse, in which an endogenous NgR gene has been altered 
by homologous recombination between the endogenous gene and an exogenous DNA 
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molecule introduced into a cell of the animal, e.g., an embryonic cell of the animal, 
prior to development of the animal. 

A transgenic animal of the invention can be created by introducing NgR- 
encoding nucleic acid into the male pronuclei of a fertilized oocyte, e.g., by 
5 microinjection, retroviral infection, and allowing the oocyte to develop in a 

pseudopregnant female foster animal. The human NgR DNA sequence of SEQ ID 
NOs:l or 3 can be introduced as a transgene into the genome of a non-human animal. 
Alternatively, a nonhuman homolog of the human NgR gene, such as a mouse NgR 
gene, can be isolated based on hybridization to the human NgR cDNA (described 
10 further above) and used as a transgene. Intronic sequences and polyadenylation signals 
can also be included in the transgene to increase the efficiency of expression of the 
transgene. A tissue-specific regulatory sequence(s) can be operably linked to the NgR 
transgene to direct expression of NgR protein to particular cells. Methods for 
generating transgenic animals via embryo manipulation and microinjection, particularly 
1 5 animals such as mice, have become conventional in the art and are described, for 

example, inU.S. Pat. Nos. 4,736,866; 4,870,009; and 4,873,191; and Hogan 1986, in 
Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY. Similar methods are used for production of other transgenic 
animals. A transgenic founder animal can be identified based upon the presence of the 
20 NgR transgene in its genome and/or expression of NgR mRNA in tissues or cells of the 
animals. A transgenic founder animal can then be used to breed additional animals 
carrying the transgene. Moreover, transgenic animals carrying a transgene encoding 
NgR can further be bred to other transgenic animals carrying other transgenes. 

To create a homologous recombinant animal, a vector is prepared which 
25 contains at least a portion of a NgR gene into which a deletion, addition or substitution 

has been introduced to thereby alter, e.g., functionally disrupt, the NgR gene. The 
NgR gene can be a human gene (e.g, SEQ ID NOs: 1 or 13), but more preferably, is a 
non-human homolog of a human NgR gene. For example, a mouse homolog of human 
NgR gene of SEQ ID NOs: 1 or 13 can be used to construct a homologous 
30 recombination vector suitable for altering an endogenous NgR gene in the mouse 
genome. In one embodiment, the vector is designed such that, upon homologous 
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recombination, the endogenous NgR gene is functionally disrupted (z. e . , no longer 
encodes a functional protein; also referred to as a "knock out" vector). 

Alternatively, the vector can be designed such that, upon homologous 
recombination, the endogenous NgR gene is mutated or otherwise altered but still 
5 encodes functional protein ( e.g :, the upstream regulatory region can be altered to 

thereby alter the expression of the endogenous NgR protein). In the homologous 
recombination vector, the altered portion of the NgR gene is flanked at its 5' and 3' 
ends by additional nucleic acid of the NgR gene to allow for homologous 
recombination to occur between the exogenous NgR gene carried by the vector and an 
10 endogenous NgR gene in an embryonic stem cell. The additional flanking NgR nucleic 
acid is of sufficient length for successful homologous recombination with the 
endogenous gene. Typically, several kilobases of flanking DNA (both at the 5' and 3' 
ends) are included in the vector. See e.g. , Thomas et al. (1987) Cell 5 1:503 for a 
description of homologous recombination vectors. The vector is introduced into an 
15 embryonic stem cell line (e.g., by electroporation) and cells in which the introduced 

NgR gene has homologously recombined with the endogenous NgR gene are selected 
(see e.g., Li et al. (1992) Cell 69:915). 

The selected cells are then injected into a blastocyst of an animal (e.g., a 
mouse) to form aggregation chimeras. See e.g., Bradley 1987, In: 

20 TERATOCARCINOMAS and EMBRYONIC Stem CELLS: A Practical Approach, 

Robertson, ed. IRL, Oxford, pp. 113-152. A chimeric embryo can then be implanted 
into a suitable pseudopregnant female foster animal and the embryo brought to term. 
Progeny harboring the homologously recombined DNA in their germ cells can be used 
to breed animals in which all cells of the animal contain the homologously recombined 
25 DNA by germline transmission of the transgene. Methods for constructing 

homologous recombination vectors and homologous recombinant animals are 
described further in Bradley (1991) Curr. Opin. Biotechnol. 2:823-829; PCT 
International Publication Nos.: WO 90/1 1354; WO 91/01140; WO 92/0968; and WO 
93/04169. 

30 In another embodiment, transgenic non-humans animals can be produced that 

contain selected systems that allow for regulated expression of the transgene. One 
example of such a system is the cre/loxP recombinase system of bacteriophage PI. For 
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a description of the cre/loxP recombinase system, see, e.g., Lakso et al. (1992) Proc. 
Natl. Acad. Sci. USA 89:6232-6236. Another example of a recombinase system is the 
FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et al. (1991) 

Science 251:1351-1355. If a cre/loxP recombinase system is used to regulate 
5 expression of the transgene, animals containing transgenes encoding both the Cre 
recombinase and a selected protein are required. Such animals can be provided 
through the construction of "double" transgenic animals, e.g., by mating two 
transgenic animals, one containing a transgene encoding a selected protein and the 
other containing a transgene encoding a recombinase. 

10 Clones of the non-human transgenic animals described herein can also be 

produced according to the methods described in Wilmut et al. (1997) Nature 385:810- 
813. In brief, a cell, e.g., a somatic cell, from the transgenic animal can be isolated and 
induced to exit the growth cycle and enter G 0 phase. The quiescent cell can then be 
fused, e.g., through the use of electrical pulses, to an enucleated oocyte from an animal 
15 of the same species from which the quiescent cell is isolated. The reconstructed 

oocyte is then cultured such that it develops to morula or blastocyte and then 
transferred to pseudopregnant female foster animal. The offspring borne of this female 
foster animal will be a clone of the animal from which the cell, e.g., the somatic cell, is 
isolated. 

20 

Antisense 

Also provided by the invention are antisense polynucleotides that recognize and 
hybridize to NgR polynucleotides. Full-length and fragment antisense polynucleotides 
are provided. Fragment antisense molecules of the invention include (i) those that 
25 specifically recognize and hybridize to NgR RNA (as determined by sequence 
comparison of DNA encoding NgR to DNA encoding other known molecules). 
Identification of sequences unique to NgR encoding polynucleotides can be deduced 
through use of any publicly available sequence database, and/or through use of 
commercially available sequence comparison programs. After identification of the 
30 desired sequences, isolation through restriction digestion or amplification using any of 
the various polymerase chain reaction techniques well known in the art can be 
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performed. Antisense polynucleotides are particularly relevant to regulating 
expression of NgR by those cells expressing NgR mRNA. 

Antisense oligonucleotides, or fragments of a nucleotide sequence set forth in 
SEQ ID NO:l, 3, 13 or sequences complementary or homologous thereto, derived 
5 from the nucleotide sequences of the present invention encoding NgR are useful as 
diagnostic tools for probing gene expression in various tissues. For example, tissue 
can be probed in situ with oligonucleotide probes carrying detectable groups by 
conventional autoradiography techniques to investigate native expression of this 
enzyme or pathological conditions relating thereto. In specific aspects, antisense 
10 nucleic acid molecules are provided that comprise a sequence complementary to at 

least about 10, 25, 50, 100, 250 or 500 nucleotides or an entire NgR coding strand, or 
to only a portion thereof. Nucleic acid molecules encoding fragments, homologs, 
derivatives and analogs of a NgR protein of SEQ ID NO:2, 4 or 14 or antisense 
nucleic acids complementary to a NgR nucleic acid sequence of SEQ ID NOs: 1, 3 or 
15 13 are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 
region" of the coding strand of a nucleotide sequence encoding NgR. The term 
"coding region" refers to the region of the nucleotide sequence comprising codons 
which are translated into amino acid residues (e.g., the protein coding region of human 
20 NgR corresponds to the coding region SEQ ID NO:l, 3 or 13). In another 

embodiment, the antisense nucleic acid molecule is antisense to a "noncoding region" 
of the coding strand of a nucleotide sequence encoding NgR. The term "noncoding 
region" refers to 5' and 3' sequences which flank the coding region that are not 
translated into amino acids (i.e., also referred to as 5* and 3' untranslated regions). 

25 Antisense oligonucleotides are preferably directed to regulatory regions of a 

nucleotide sequence of SEQ ID NO:l, 3, 13 or mRNA corresponding thereto, 
including, but not limited to, the initiation codon, TATA box, enhancer sequences, and 
the like. Given the coding strand sequences encoding NgR disclosed herein (e.g., SEQ 
ID NO:l, 3 or 13), antisense nucleic acids of the invention can be designed according 
30 to the rules of Watson and Crick or Hoogsteen base pairing. The antisense nucleic 
acid molecule can be complementary to the entire coding region of NgR mRNA, but 
more preferably is an oligonucleotide that is antisense to only a portion of the coding 
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or noncoding region of NgR mRNA. For example, the antisense oligonucleotide can 
be complementary to the region surrounding the translation start site of NgR mRNA. 
An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 
45 or 50 nucleotides in length. An antisense nucleic acid of the invention can be 
5 constructed using chemical synthesis or enzymatic ligation reactions using procedures 
known in the art. For example, an antisense nucleic acid (e.g., an antisense 
oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or 
variously modified nucleotides designed to increase the biological stability of the 
molecules or to increase the physical stability of the duplex formed between the 
10 antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine 
substituted nucleotides can be used. 

Examples of modified nucleotides that can be used to generate the antisense 
nucleic acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, 
hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5- 
1 5 carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 

dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1- 
methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2- 
methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5- 
methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D- 
20 mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 

N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5- 
methyluracil, uracil-5 -oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl- 
2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6- 
25 diaminopurine. Alternatively, the antisense nucleic acid can be produced biologically 
using an expression vector into which a nucleic acid has been subcloned in an antisense 
orientation (i. e . , RNA transcribed from the inserted nucleic acid will be of an antisense 
orientation to a target nucleic acid of interest, described further in the following 
subsection). 

30 The antisense nucleic acid molecules of the invention (preferably 

oligonucleotides of 1 0 to 20 nucleotides in length) are typically administered to a 
subject or generated in situ such that they hybridize with or bind to cellular mKNA 
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and/or genomic DNA encoding a NgR protein to thereby inhibit expression of the 
protein, e.g., by inhibiting transcription and/or translation. Suppression of NgR 
expression at either the transcriptional or translational level is useful to generate 
cellular or animal models for diseases/conditions characterized by aberrant NgR 
5 expression. The hybridization can be by conventional nucleotide complementarity to 
form a stable duplex, or, for example, in the case of an antisense nucleic acid molecule 
that binds to DNA duplexes, through specific interactions in the major groove of the 
double helix. 

Phosphorothioate and methylphosphonate antisense oligonucleotides are 
10 specifically contemplated for therapeutic use by the invention. The antisense 
oligonucleotides may be further modified by adding poly-L-lysine, transferrin 
polylysine or cholesterol moieties at their 5' end. 

An example of a route of administration of antisense nucleic acid molecules of 
the invention includes direct injection at a tissue site. Alternatively, antisense nucleic 
15 acid molecules can be modified to target selected cells and then administered 

systemically. For example, for systemic administration, antisense molecules can be 
modified such that they specifically bind to receptors or antigens expressed on a 
selected cell surface, e.g . , by linking the antisense nucleic acid molecules to peptides or 
antibodies that bind to cell surface receptors or antigens. The antisense nucleic acid 
20 molecules can also be delivered to cells using the vectors described herein. To achieve 
sufficient intracellular concentrations of antisense molecules, vector constructs in 
which the antisense nucleic acid molecule is placed under the control of a strong pol II 
or pol III promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention 

25 is an a-anomeric nucleic acid molecule. An cc-anomeric nucleic acid molecule forms 

\ 

specific double-stranded hybrids with complementary RNA in which, contrary to the 
usual P-units, the strands run parallel to each other (Gaultier et al., (1987) Nucleic 
Acids Res. 15, 6625-6641). The antisense nucleic acid molecule can also comprise a 
2'-o-methylribonucleotide (Inoue et al., (1987) Nucleic Acids Res. 15, 6131-6148) or a 
30 chimeric RNA-DNA analogue (Inoue et al., (1987) FEB S Lett. 215, 327-330). 

The NgR sequences taught in the present invention facilitate the design of 
novel transcription factors for modulating NgR expression in native cells and animals. 
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and cells transformed or transfected with NgR polynucleotides. For example, the 
Cys 2 -His 2 zinc finger proteins, which bind DNA via their zinc finger domains, have 
been shown to be amenable to structural changes that lead to the recognition of 
different target sequences. These artificial zinc finger proteins recognize specific target 
5 sites with high affinity and low dissociation constants, and are able to act as gene 
switches to modulate gene expression. Knowledge of the particular NgR target 
sequence of the present invention facilitates the engineering of zinc finger proteins 
specific for the target sequence using known methods such as a combination of 
structure-based modeling and screening of phage display libraries (Segal et al., (1999) 
10 Proc. Natl. Acad. Sci. USA 96, 2758-2763; Liu et al., (1997) Proc. Natl. Acad. Sci. 

USA 94, 5525-5530; Greisman et al. (1997) Science 275, 657-661; Choo et al., (1997) 
J. Mol. Biol. 273, 525-532). Each zinc finger domain usually recognizes three or more 
base pairs. Since a recognition sequence of 18 base pairs is generally sufficient in 
length to render it unique in any known genome, a zinc finger protein consisting of 6 
1 5 tandem repeats of zinc fingers would be expected to ensure specificity for a particular 

sequence (Segal et al., (1999), above). The artificial zinc finger repeats, designed 
based on the promoter of NgR sequences, are fused to activation or repression 
domains to promote or suppress NgR expression (Liu et al., (1997), above). The 
promoter of NgR may be obtained by standard methods known to one of ordinary skill 
20 in the art with the disclosure contained herein and knowledge of the NgR sequence. 
Alternatively, the zinc finger domains can be fused to the TATA box-binding factor 
(TBP) with varying lengths of linker region between the zinc finger peptide and the 
TBP to create either transcriptional activators or repressors (Kim et al., (1997) Proc. 
Natl. Acad. Sci. USA 94, 3616-3620. Such proteins and polynucleotides that encode 
25 them, have utility for modulating NgR expression in vivo in both native cells, animals 
and humans; and/or cells transfected with NgR-encoding sequences. The novel 
transcription factor can be delivered to the target cells by transfecting constructs that 
express the transcription factor (gene therapy), or by introducing the protein. 
Engineered zinc finger proteins can also be designed to bind RNA sequences for use in 
30 therapeutics as alternatives to antisense or catalytic RNA methods (McColl et al., 
(1997) Proc. Natl. Acad. Sci. USA 96, 9521-9526); Wu etal., (1995) Proc. Natl. 

Acad. Sci. USA 92, 344-348). The present invention contemplates methods of 
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designing such transcription factors based on the gene sequence of the invention, as 
well as customized zinc finger proteins, that are useful to modulate NgR expression in 
cells (native or transformed) whose genetic complement includes these sequences. 

5 Ribozymes and PNA moieties 

In still another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity that are 
capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they 
have a complementary region. Thus, ribozymes {e.g., hammerhead ribozymes, 

10 described in Haselhoff and Gerlach (1988) Nature 334, 585-591) can be used to 

catalytically cleave NgR mRNA transcripts to thereby inhibit translation of NgR 
mRNA. A ribozyme having specificity for a NgR-encoding nucleic acid can be 
designed based upon the nucleotide sequence of a NgR DNA disclosed herein (i.e., 
SEQ ID NOs:l, 3 or 13). For example, a derivative of a Tetrahymena L-19 IVS RNA 
15 can be constructed in which the nucleotide sequence of the active site is 

complementary to the nucleotide sequence to be cleaved in a NgR-encoding mRNA. 
See, e.g., Cech et al. U.S. Patent No. 4,987,071; and Cech et al. U.S. Patent No. 

5, 1 16,742. Alternatively, NgR mRNA can be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA molecules. See, e.g., Bartel et al., 

20 (1993) Science 261, 141 1-1418. 

Alternatively, NgR gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of the NgR {e.g., the NgR 
promoter and/or enhancers) to form triple helical structures that prevent transcription 
of the NgR gene in target cells. See generally, Helene (1991) Anti cancer Drug Des. 6: 
25 569-584; Helene, et al., (1992) Ann. N.Y. Acad. Sci. 660:27-36; and Maher (1992) 

BioEssays 14, 807-815. 

In various embodiments, the nucleic acids of NgR can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, 
hybridization, or solubility of the molecule. For example, the deoxyribose phosphate 
30 backbone of the nucleic acids can be modified to generate peptide nucleic acids (see 

Hyrup et al., (1996) Bioorg. Med. Chem. Lett. 4, 5-23). As used herein, the terms 
"peptide nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., DNA mimics, in 
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which the deoxyribose phosphate backbone is replaced by a pseudopeptide backbone 
and only the four natural nucleobases are retained. The neutral backbone of PNAs has 
been shown to allow for specific hybridization to DNA and RNA under conditions of 
low ionic strength. The synthesis of PNA oligomers can be performed using standard 
5 solid phase peptide synthesis protocols as described in Hyrup et al., (1996) above; 
Perry-O'Keefe et al., (1996) Proc. Natl. Acad. Sci. USA 93,14670-14675. 

PNAs of NgR can be used in therapeutic and diagnostic applications. For 
example, PNAs can be used as antisense or antigene agents for sequence-specific 
modulation of gene expression by, e.g., inducing transcription or translation arrest or 
10 inhibiting replication. PNAs of NgR can also be used, e.g., in the analysis of single 
base pair mutations in a gene by, e.g., PNA directed PCR clamping; as artificial 
restriction enzymes when used in combination with other enzymes, e.g., SI nucleases 
(Hyrup (1996), above); or as probes or primers for DNA sequence and hybridization 
(Hyrup et al., (1996), above; Perry-O'Keefe (1996), above). 

15 In another embodiment, PNAs of NgR can be modified, e.g., to enhance their 

stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by 
the formation of PNA-DNA chimeras, or by the use of liposomes or other techniques 
of drug delivery known in the art. For example, PNA-DNA chimeras of NgR can be 
generated that may combine the advantageous properties of PNA and DNA. Such 
20 chimeras allow DNA recognition enzymes, e.g., RNase H and DNA polymerases, to 
interact with the DNA portion while the PNA portion would provide high binding 
affinity and specificity. PNA-DNA chimeras can be linked using linkers of appropriate 
lengths selected in terms of base stacking, number of bonds between the nucleobases, 
and orientation (Hyrup (1996), above). The synthesis of PNA-DNA chimeras can be 
25 performed as described in Hyrup (1996), above and Finn et al. (1996) Nucleic Acids 
Res. 24, 3357-3363. For example, a DNA chain can be synthesized on a solid support 
using standard phosphoramidite coupling chemistry, and modified nucleoside analogs, 
e.g., 5'-(4-methoxytrityl) amino- 5 '-deoxy-thymidine phosphoramidite, can be used 
between the PNA and the 5' end of DNA (Mag et al. (1989) Nucleic Acids Res. 17, 

30 973-988). PNA monomers are then coupled in a stepwise manner to produce a 

chimeric molecule with a 5' PNA segment and a 3' DNA segment (Finn et al. (1996), 
above). Alternatively, chimeric molecules can be synthesized with a 5' DNA segment 
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and a 3' PNA segment. See, Petersen et al. (1975) Bioorg. Med. Chem. Lett. 5:11 19- 
1124. 

In other embodiments, the oligonucleotide may include other appended groups 
such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating 
5 transport across the cell membrane (see Letsinger et al., (1989) Proc. Natl. Acad. Sci. 
USA 86, 6553-6556; Lemaitre etal., (1987) Proc. Natl. Acad. Sci. USA 84, 648-652; 
PCT Publication No. WO 88/09810) or the blood-brain barrier (see, e.g., PCT 
Publication No. WO 89/10134). In addition, oligonucleotides can be modified with 
hybridization triggered cleavage agents (see, e.g., Krol et al., (1988) Biotechniques 6, 
10 958-976) or intercalating agents (see, e.g., Zon (1988) Pharm. Res. 5, 539-549). To 

this end, the oligonucleotide may be conjugated to another molecule, e.g., a peptide, a 
hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered 
cleavage agent, etc. 

Automated sequencing methods can be used to obtain or verify the nucleotide 
15 sequence of NgR. The NgR nucleotide sequences of the present invention are believed 
to be 100% accurate. However, as is known in the art, nucleotide sequence obtained 
by automated methods may contain some errors. Nucleotide sequences determined by 
automation are typically at least about 90%, more typically at least about 95% to at 
least about 99.9% identical to the actual nucleotide sequence of a given nucleic acid 
20 molecule. The actual sequence may be more precisely determined using manual 

sequencing methods, which are well known in the art. An error in a sequence which 
results in an insertion or deletion of one or more nucleotides may result in a frame shift 
in translation such that the predicted amino acid sequence will differ from that which 
would be predicted from the actual nucleotide sequence of the nucleic acid molecule, 
25 starting at the point of the mutation. 

Polypeptides 

The invention also provides purified and isolated mammalian NgR polypeptides 
encoded by a polynucleotide of the invention. Presently preferred is a human NgR 
30 polypeptide comprising the amino acid sequence set forth in SEQ ID NO:2 or SEQ ID 
NO: 14. Another preferred embodiment is a mouse NgR polypeptide comprising the 
amino acid sequence of NgR3, as set forth in SEQ ID NO:4. 
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One aspect of the invention pertains to isolated NgR proteins, and biologically 
active portions thereof, or derivatives, fragments, analogs or homologs thereof. Also 
provided are polypeptide fragments suitable for use as immunogens to raise anti-NgR 
antibodies. Preferably, fragments of NgR proteins comprise at least one biological 
5 activity of NgR. In one embodiment, native NgR proteins can be isolated from cells or 

tissue sources by an appropriate purification scheme using standard protein purification 
techniques. In another embodiment, NgR proteins are produced by recombinant DNA 
techniques. Alternative to recombinant expression, a NgR protein or polypeptide can 
be synthesized chemically using standard peptide synthesis techniques. 

10 The invention also embraces polypeptides that have at least 99%, at least 95%, 

at least 90%, at least 85%, at least 80%, at least 75%, at least 70%, at least 65%, at 
least 60%, at least 55%, at least 50% or at least 45% identity and/or homology to the 
preferred polypeptide of the invention. In addition, the invention embraces 
polypeptides having the consensus sequence shown in SEQ ID NO: 6, shown in Table 
15 5) excluding the previously characterized NgR ("NgRl"), and polypeptides comprising 

at least about 90% of the consensus sequence. 

The term "percentage of sequence identity" is calculated by comparing two 
optimally aligned sequences over that region of comparison, determining the number of 
positions at which the identical nucleic acid base (e.g.. A, T, C, G, U, or I, in the case 
20 of nucleic acids) occurs in both sequences to yield the number of matched positions, 

dividing the number of matched positions by the total number of positions in the region 
of comparison (i.e., the window size), and multiplying the result by 100 to yield the 
percentage of sequence identity. The term "substantial identity" as used herein denotes 
a characteristic of a polynucleotide sequence, wherein the polynucleotide comprises a 
25 sequence that has at least 80 percent sequence identity, preferably at least 85 percent 
identity and often 90 to 95 percent sequence identity, more usually at least 99 percent 
sequence identity as compared to a reference sequence over a comparison region. 

In one aspect, percent homology is calculated as the percentage of amino acid 
residues in the smaller of two sequences which align with identical amino acid residue 
30 in the sequence being compared, when four gaps in a length of 100 amino acids may be 
introduced to maximize alignment (Dayhoff, in ATLAS OF PROTEIN SEQUENCE AND 
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Structure, Vol. 5, p. 124, National Biochemical Research Foundation, Washington, 
D.C. (1972), incorporated herein by reference). 

A determination of homology or identity is typically made by a computer 
homology program known in the art. An exemplary program is the Gap program 
5 (Wisconsin Sequence Analysis Package, Version 8 for UNIX, Genetics Computer 

Group, University Research Park, Madison, WI) using the default settings, which uses 
the algorithm of Smith and Waterman (Adv. Appl. Math., 1981, 2, 482-489, which in 
incorporated herein by reference in its entirety). Employing the GAP software 
provided in the GCG program package, (see Needleman and Wunsch (1970) J. Mol. 

10 Biol. 48, 443-453) the following settings for nucleic acid sequence comparison may be 
used: GAP creation penalty of 5.0 and GAP extension penalty of 0.3, the coding 
region of the analogous nucleic acid sequences referred to above exhibits a degree of 
identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99%, with 
the CDS (encoding) part of the DNA sequence shown in SEQ ID NOs:l, 3 or 13. 

15 BestFit was originally written for Version 1.0 by Paul Haeberli from a careful reading 
of the papers by Needleman and Wunsch (1970), above, and Smith and Waterman 
(1981), above. The following Bestfit settings for nucleic acid sequence comparison 
may be used: GAP creation penalty of 8.0 and GAP extension penalty of 2, the coding 
region of the analogous nucleic acid sequences referred to above exhibits a degree of 
20 identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99%, with the 
CDS (encoding) part of the amino acid sequence shown in SEQ ID NOs:2, 4 or 14. 

Alternatively, homology may be determined by hybridization analysis wherein a 
nucleic acid sequence is hybridized to the complement of a sequence encoding the 
aforementioned proteins under stringent, moderately stringent, or low stringent 
25 conditions. See e.g. Ausubel, et al., (Eds.) Current Protocols in Molecular 
Biology, John Wiley & Sons, New York, NY, 1993, and below. 

Polypeptides of the invention may be isolated from natural cell sources or may 
be chemically synthesized, but are preferably produced by recombinant procedures 
involving host cells of the invention. 

30 An "isolated" or "purified" protein or biologically active portion thereof is 

substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the NgR protein is derived, or substantially free from 
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chemical precursors or other chemicals when chemically synthesized. The language 
"substantially free of cellular material" includes preparations of NgR protein in which 
the protein is separated from cellular components of the cells from which it is isolated 
or recombinantly produced. In one embodiment, the language "substantially free of 
5 cellular material" includes preparations of NgR protein having less than about 30% (by 
dry weight) of non-NgR protein (also referred to herein as a "contaminating protein"), 
more preferably less than about 20% of non-NgR protein, still more preferably less 
than about 10% of non-NgR protein, and most preferably less than about 5% non-NgR 
protein. When the NgR protein or biologically active portion thereof is recombinantly 
10 produced, it is also preferably substantially free of culture medium, i.e., culture 

medium represents less than about 20%, more preferably less than about 10%, and 
most preferably less than about 5% of the volume of the protein preparation. 

The language "substantially free of chemical precursors or other chemicals" 
includes preparations of NgR protein in which the protein is separated from chemical 
15 precursors or other chemicals that are involved in the synthesis of the protein. In one 
embodiment, the language "substantially free of chemical precursors or other 
chemicals" includes preparations of NgR protein having less than about 30% (by dry 
weight) of chemical precursors or non-NgR chemicals, more preferably less than about 
20% chemical precursors or non-NgR chemicals, still more preferably less than about 
20 10% chemical precursors or non-NgR chemicals, and most preferably less than about 

5% chemical precursors or non-NgR chemicals. 

Biologically active portions of a NgR protein include peptides comprising 
amino acid sequences sufficiently homologous to or derived from the amino acid 
sequence of the NgR protein, e.g., the amino acid sequence shown in SEQ ID NO:2, 4 
25 or 14 that include fewer amino acids than the full length NgR proteins, and exhibit at 
least one activity of a NgR protein. Typically, biologically active portions comprise a 
domain or motif with at least one activity of the NgR protein. A biologically active 
portion of a NgR protein can be a polypeptide which is, for example, 10, 25, 50, 100 
or more amino acids in length. 

30 A biologically active portion of a NgR protein of the present invention may 

contain at least one of the features that is conserved between the NgR proteins (e.g., a 
conserved cysteine as the N-terminus of the mature protein, four conserved cysteines 
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in the N-terminus before a leucine-rich region, four conserved cysteines C-terminal 
with respect to a leucine repeat region, eight leucine-rich repeats, and a hydrophobic 
C-terminus). An alternative biologically active portion of a NgR protein may contain 
at least two of the above-identified domains. Another biologically active portion of a 
5 NgR protein may contain at least three of the above-identified domains. Yet another 
biologically active portion of a NgR protein of the present invention may contain at 
least four of the above-identified domains. 

Moreover, other biologically active portions, in which other regions of the 
protein are deleted, can be prepared by recombinant techniques and evaluated for one 
10 or more of the functional activities of a native NgR protein. 

In an embodiment, the NgR protein has an amino acid sequence shown in SEQ 
ID NO: 2, 4 or 14. In other embodiments, the NgR protein is substantially homologous 
to SEQ ID NO: 2, 4 or 14 and retains the functional activity of the protein of SEQ ID 
NO:2, 4 or 14, yet differs in amino acid sequence due to natural allelic variation or 
15 mutagenesis, as described in detail below. 

Accordingly, in another embodiment, the NgR protein is a protein that 
comprises an amino acid sequence at least about 45% homologous to the amino acid 
sequence of SEQ ID NO:2 or SEQ ID NO:4 or SEQ ID NO: 14 and retains the 
functional activity of the NgR proteins of SEQ ID NO:2, 4 or 14. 

20 Use of mammalian host cells is expected to provide for such post-translational 

modifications (e.g., glycosylation, truncation, lipidation and phosphorylation) as may 
be needed to confer optimal biological activity on recombinant expression products of 
the invention. Glycosylated and non-glycosylated forms of NgR polypeptides are 
embraced by the invention. 

25 The invention also embraces variant (or analog) NgR polypeptides. In one 

example, insertion variants are provided wherein one or more amino acid residues 
supplement a NgR amino acid sequence. Insertions may be located at either or both 
termini of the protein, or may be positioned within internal regions of the NgR amino 
acid sequence. Insertional variants with additional residues at either or both termini 
30 can include, for example, fusion proteins and proteins including amino acid tags or 
labels. 



WO 02/29059 


PCT/US01/31488 


-46 - 

Insertion variants include NgR polypeptides wherein one or more amino acid 
residues are added to a NgR acid sequence or to a biologically active fragment thereof. 

Variant products of the invention also include mature NgR products, i.e., NgR 
products wherein leader or signal sequences are removed, with additional amino 
5 terminal residues. The additional amino terminal residues may be derived from another 
protein, or may include one or more residues that are not identifiable as being derived 
from specific proteins. NgR products with an additional methionine residue at 
position -1 (Met" 1 -NgR) are contemplated, as are variants with additional methionine 
and lysine residues at positions -2 and -1 (Met' 2 -Ly s' 1 -N gR) . Variants of NgR with 
10 additional Met, Met-Lys, Lys residues (or one or more basic residues in general) are 
particularly useful for enhanced. recombinant protein production in bacterial host cells. 

Polypeptide Variants 

The invention also embraces NgR variants having additional amino acid 
15 residues which result from use of specific expression systems. 

As used herein, a NgR "chimeric protein" or "fusion protein" comprises a NgR 
polypeptide operatively linked to a non-NgR polypeptide. A "NgR polypeptide" refers 
to a polypeptide having an amino acid sequence corresponding to NgR, whereas a 
"non-NgR polypeptide" refers to a polypeptide having an amino acid sequence 
20 corresponding to a protein that is not homologous to the NgR protein, e.g., a protein 
that is different from the NgR protein and that is derived from the same or a different 
organism. Within a NgR fusion protein the NgR polypeptide can correspond to all or a 
portion of a NgR protein. In one embodiment, a NgR fusion protein comprises at least 
one biologically active portion of a NgR protein. In another embodiment, a NgR 
25 fusion protein comprises at least two biologically active portions of a NgR protein. In 
yet another embodiment, a NgR fusion protein comprises at least three biologically 
active portions of a NgR protein. Within the fusion protein, the term "operatively 
linked" is intended to indicate that the NgR polypeptide and the non-NgR polypeptide 
are fused in-frame to each other. The non-NgR polypeptide can be fused to the N- 
30 terminus or C-terminus of the NgR polypeptide. 

For example, in one embodiment a NgR fusion protein comprises a NgR 
domain operably linked to the extracellular domain of a second protein. Such fusion 
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proteins can be further utilized in screening assays for compounds which modulate 
NgR activity (such assays are described in detail below). 

For example, use of commercially available vectors that express a desired 
polypeptide as part of a glutathione-S-transferase (GST) fusion product provides the 
5 desired polypeptide having an additional glycine residue at position -1 after cleavage of 
the GST component from the desired polypeptide. 

In another embodiment, the fusion protein is a NgR protein containing a 
heterologous signal sequence at its N-terminus. For example, the native NgR signal 
sequence ( i.e ., amino acids 1-30 of SEQ ID NO: 2 and amino acids 1-40 of SEQ ID 
10 NO:4) can be removed and replaced with a signal sequence from another protein. In 

certain host cells (e.g . , mammalian host cells), expression and/or secretion NgR can be 
increased through use of a heterologous signal sequence. 

In yet another embodiment, the fusion protein is a NgR-immunoglobulin fusion 
protein in which the NgR sequences comprising one or more domains are fused to 
15 sequences derived from a member of the immunoglobulin protein family. The NgR- 
immunoglobulin fusion proteins of the invention can be incorporated into 
pharmaceutical compositions and administered to a subject to inhibit an interaction 
between NgR ligand and a NgR protein on the surface of a cell, to thereby suppress 
NgR-mediated signal transduction in vivo. NgR-immunoglobulin fusion proteins can be 
20 used to affect the bioavailability of a NgR cognate ligand. Inhibition of the NgR 
ligand/NgR interaction may be useful therapeutically for both the treatment of 
proliferative and differentiative disorders, as well as modulating (e.g., promoting or 
inhibiting) cell survival. Moreover, the NgR-immunoglobulin fusion proteins of the 
invention can be used as immunogens to produce anti-NgR antibodies in a subject, to 
25 purify NgR ligands, and in screening assays to identify molecules that inhibit the 
interaction of NgR with NgR ligand. 

A NgR chimeric or fusion protein of the invention can be produced by standard 
recombinant DNA techniques. For example, DNA fragments coding for the different 
polypeptide sequences are ligated together in-frame in accordance with conventional 
30 techniques, e.g., by employing blunt-ended or stagger-ended termini for ligation, 

restriction enzyme digestion to provide for appropriate termini, filling-in of cohesive 
ends as appropriate, alkaline phosphatase treatment to avoid undesirable joining and 
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enzymatic ligation. In another embodiment, the fusion gene can be synthesized by 
conventional techniques including automated DNA synthesizers. Alternatively, PCR 
amplification of gene fragments can be carried out using anchor primers that give rise 
to complementary overhangs between two consecutive gene fragments that can 
5 subsequently be annealed and reamplified to generate a chimeric gene sequence (see, 
for example, Ausubel et al. (Eds.) CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, 
John Wiley & Sons, 1992). Moreover, many expression vectors are commercially 
available that already encode a fusion moiety (e.g . , a GST polypeptide). ANgR- 
encoding nucleic acid can be cloned into such an expression vector such that the fusion 
10 moiety is linked in-frame to the NgR protein. 

Variants resulting from expression in other vector systems are also 
contemplated. 

Insertional variants also include fusion proteins wherein the amino terminus 
and/or the carboxy terminus of NgR is/are fused to another polypeptide. 

15 In another aspect, the invention provides deletion variants wherein one or more 

amino acid residues in a NgR polypeptide are removed. Deletions can be effected at 
one or both ter m i ni of the NgR polypeptide, or with removal of one or more 
non-terminal amino acid residues of NgR. Deletion variants, therefore, include all 
fragments of a NgR polypeptide. 

20 The invention also embraces polypeptide fragments of the sequence set forth in 

SEQ ID NO:2, 4 or 14 wherein the fragments maintain biological (e.g . , ligand binding 
and/or intracellular signaling) immunological properties of a NgR polypeptide. 
Fragments comprising at least 4, 5, 10, 15, 20, 25, 30, 35, or 40 consecutive amino 
acids of SEQ ID NO:2, 4 or 14 are contemplated by the invention. Preferred 
25 polypeptide fragments display antigenic properties unique to, or specific for, human 
NgR and its allelic and species homologs. Fragments of the invention having the 
desired biological and immunological properties can be prepared by any of the methods 
well known and routinely practiced in the art. 

In still another aspect, the invention provides substitution variants of NgR 
30 polypeptides. Substitution variants include those polypeptides wherein one or more 
amino acid residues of a NgR polypeptide are removed and replaced with alternative 
residues. In one aspect, the substitutions are conservative in nature; however, the 
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10 


15 


20 


25 


invention embraces substitutions that are also non-conservative. Conservative 
substitutions for this purpose may be defined as set out in Tables 2, 3, or 4 below. 


Table 1. 


x aa # 

(based on a NTLRRCT 
domain) 

Column I 
(Rl, R2, R3) 

Column n 
(R2+R3 only) 

X x 

G, R, M 


x 2 

A, D, C 


X 3 

V, T 


X 4 

N, P, S 


X 5 

E, A, S 


X 6 

nothing, K 

nothing 

X 7 

V, M, P 


X 8 

T, V 

V 

x 9 

Q,P 

Q 

Xio 

Q, A 

Q 

X u 

Q, H, N 


X 12 

G, N 

N 

X 13 

L,F 

F 

Xi4 

Q,A,S 


X 15 

A, S 


X 16 

V.I 


Xi7 

V, T, E, L 


Xj8 

S, G 


X i9 

L,I 


X 20 

A, E, V, P 


X 21 

A, S, D 


X 22 

S, T 


X 2 3 

Q,E 
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X aa # 

(based on a NTLRRCT 
domain) 

Column I 
(Rl, R2, R3) 

Column II 
(R2+R3 only) 

x 2+ 

IVL 


,x 25 

Q,H 

Q 

X 26 

N,G 

N 

X 27 

R,L 


X 2g 

T,G,R,S 


X 29 

F,L,T,H 


X 30 

l,v 

L 

X 31 

Q,R,P 


X 32 

Q,P,A 

P 

X 33 

G,A 

G 

X 34 

H,T,S 


X 35 

S,G,R 


X 36 

P,S,A 


X 37 

C, nothing 

nothing 

X 38 

R, nothing 

nothing 

X 39 

A,N 


X 40 

M,L 


X 4I 

V,L,T 


X 42 

T, I 

T 

X 43 

L,I 


X 44 

y,f,h 


X 45 

N,V 

N 

X4 6 

I,L 


X 47 

T 3 S,A 


X 48 

f,y,t,r 


X 49 

a,h,y,d 



25 
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x aa # 

(based on a NTLRRCT 
domain) 

Column I 
(Rl, R2, R3) 

Column EE 
(R2+R3 only) 

^128 

P,D 


Xj29 

V,G,P,R 


^130 

A,S 


X 131 

E,Q 

Q 

^132 

F,Y 

F 

X 133 

gad 


^134 

A,P 


x 13S 

DAV 


X 136 

G,D 


^137 

A,E 


x 138 

S,P 


^139 

EA 


^140 

L,F 


x 141 

R,Q 


X 142 

R,K 

R 

X 143 

R,K 

R 

Xu. 

FA 


X 145 

G,V 


X 146 

a,d,e 


X 147 

T,P 


X 14 8 

A,V,S 


X 149 

t,s,l 


X 15 o 

E,G,P,Q 


X 151 

L,E,R 


X 152 

R,L 

R 

X 153 

G, D 



25 
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Variant polypeptides include those wherein conservative substitutions have 
been introduced by modification of polynucleotides encoding polypeptides of the 
invention. Amino acids can be classified according to physical properties and 
1 5 contribution to secondary and tertiary protein structure. A conservative substitution is 

recognized in the art as a substitution of one amino acid for another amino acid that 
has similar properties. Exemplary conservative substitutions are set out in Table 2 
(from WO 97/09433, page 10, published March 13, 1997 (PCT/GB96/02197, filed 
9/6/96), immediately below. 

20 

Table 2 

Conservative Substitutions I 
SIDE CHAIN 

CHARACTERISTIC AMINO ACID 

Aliphatic 

Non-polar GAP 

I L V 


- 55 - 


x aa # 

(based on a NTLRRCT 
domain) 

Column I 
(Rl, R2, R3) 

Column H 
(R2+R3 only) 

^154 

Q,H,A 


x 155 

Q,R 


x 156 

K,R 


^157 

l,a,r 


X 158 

R,A 

R 

X 159 

V,A,E 


X 160 

E,A,N 


X«1 

F,L 

F 

X 162 

R,Q 


X 163 

N,A,G 
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Polar - uncharged 

C STM 


NQ 

Polar - charged 

DE 


KR 

Aromatic 

HFWY 

Other 

NQDE 


5 Alternatively, conservative amino acids can be grouped as described in 

Lehninger, [BIOCHEMISTRY, Second Edition; Worth Publishers, Inc. NY, NY 
(1975), pp. 71-77] as set out in Table 3, immediately below. 

Table 3 

Conservative Substitutions II 

0 

SIDE CHAIN 


CHARACTERISTIC 

AMINO ACID 

Non-polar (hydrophobic) 

A. Aliphatic: 

ALI VP 

B. Aromatic: 

F W 

C. Sulfur-containing: 

M 

D. Boderline: 

G 

Uncharged-polar 

A. Hydroxyl: 

STY 

B. Amides: 

NQ 

C. Sylfhydryl: 

C 

D. Boderline: 

G 

Positively Charged (Basic): 

KRH 

Negatively Charged (Acidic): 

DE 


As still another alternative, exemplary conservative substitutions are set out in 
25 Table 4, below. 


Table 4 

Conservative Substitutions HI 


Original 

Residue 

Exemplary Substitution 

Ala (A) 

Val, Leu, lie 

Arg (R) 

Lys, Gin, Asn 

Asn (N) 

Gin, His, Lys, Arg 

Asp (D) 

Glu 

Cys (C) 

Ser 

Gln(Q) 

Asn 

Glu (E) 

Asp 


35 
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His (H) 

Asn, Gin, Lys, Arg 

lie (I) 

Leu, Val, Met, Ala, Phe, 

Leu (L) 

lie, Val, Met, Ala, Phe 

Lys (K) 

Arg, Gin, Asn 

Met (M) 

Leu, Phe, He 

Phe (F) 

Leu, Val, He, Ala 

Pro (P) 

Gly 

Ser (S) 

Thr 

Thr (T) 

Ser 

Trp (W) 

Tyr 

Tyr(Y) 

Trp, Phe, Thr, Ser 

Val (V) 

He, Leu, Met, Phe, Ala 


In addition, amino acid residues that are conserved among family members of 
1 5 the NgR proteins of the present invention, as indicated by the alignment presented 

herein, are also predicted to be particularly unamenable to alteration. For example, 
NgR proteins of the present invention can contain at least one domain that is a typically 
conserved region inNgRs. Examples of these conserved domains include, e.g., 
leucine-rich repeat domain. Amino acid residues that are not conserved or are only 
20 semi-conserved among members of the NgR proteins may be readily amenable to 

alteration. 

Full-length NgRs have an LRR region characterized by the amino acid 
consensus sequence shown in SEQ ID NO: 19. At least some full-length NgRs also 
include a CT signaling (CTS) domain and a GPI domain. 

25 The NgR domain designations used herein are defined as follows: 


Domain 

hNgRl 
SEQ ID: 5 

mNgRl 
SEQ ED 
NO: 17 

hNgR2 
SEQ ID: 2 

liNgR3 
SEQ ID: 14 

mNgR3 
SEQ ID: 4 

Signal Seq. 

1-26 

1-26 

1-30 

— 

1-40 

LRRNT 

27-56 

27-56 

31-59 

— 

41-69 

LRR1 

57-81 

57-81 

60-82 

5-27 

70-92 ' 

LRR2 

82-105 

82-105 

83-106 

28-51 

93-106 

LRR3 

106-130 

106-130 

107-131 

52-76 

106-141 

LRR4 

131-154 

131-154 

132-155 

77-100 

142-165 
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LRR5 

155-178 

155-178 

156-179 

101-124 

166-189 

LRR6 

179-202 

179-202 

180-203 

125-148 

190-213 

LRR7 

203-226 

203-226 

204-227 

149-172 

214-237 

LRR8 

227-250 

227-250 

228-251 

173-196 

238-261 

LRRCT 

260-309 

260-309 

261-310 

206-255 

271-320 

CTS 

(CT 

Signaling) 

310-445 

310-445 

311-395 

256-396 

321-438 

GPI 

446-473 

456-473 

396-420 

370-392 

439-462 


10 

In some embodiments of the invention, the above domains are modified. 
Modification can be in a manner that preserves domain functionality. Modification can 
include addition, deletion or substitution of certain amino acids. Exemplary 
modifications include conservative amino acid substitutions. Preferably such 
15 substitutions number 20 or fewer per 100 residues. More preferably, such 

substitutions number 10 or fewer per 100 residues. Further exemplary modifications 
include addition of flanking sequences of up to five amino acids at the N terminus 
and/or C terminus of one or more of the domains. 

In some embodiments, the isolated nucleic acid molecule encodes a polypeptide 
20 at least about 70%, 80%, 90%, 95%, 98%, and most preferably at least about 99% 
homologous to SEQ ID NO :2, 4 or 14. 

Mutations can be introduced into SEQ ID NOS:l, 3 or 13 by standard 
techniques, e.g., site-directed mutagenesis and PCR-mediated mutagenesis. 
Conservative amino acid substitutions can be made at one or more amino acid residues 
25 predicted to be non-essential. Alternatively, mutations can be introduced randomly 
along a NgR coding sequence. This can be accomplished, e g., by saturation 
mutagenesis. The resulting mutants can be screened for NgR biological activity. 
Biological activities of NgR may include but are not limited to: (1) protein:protein 
interactions, e.g., with other NgRs or other cell-surface proteins involved in Nogo- 
30 related signaling; (2) complex formation with a NgR ligand; (3) binding to an anti- 
NgR antibody. 
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It should be understood that the definition of polypeptides of the invention is 
intended to include polypeptides bearing modifications other than insertion, deletion, 
or substitution of amino acid residues. By way of example, the modifications may be 
covalent in nature, and include for example, chemical bonding with polymers, lipids, 

5 other organic and inorganic moieties. Such derivatives may be prepared to increase 
circulating half-life of a polypeptide, or may be designed to improve the targeting 
capacity of the polypeptide for desired cells, tissues or organs. Similarly, the invention 
further embraces NgR polypeptides that have been covalently modified to include one 
or more water-soluble polymer attachments such as polyethylene glycol, 

10 polyoxyethylene glycol or polypropylene glycol. Variants that display ligand binding 
properties of native NgR and are expressed at higher levels, as well as variants that 
provide for constitutively active receptors, are particularly useful in assays of the 
invention; the variants are also useful in providing cellular, tissue and animal models of 
diseases/conditions characterized by aberrant NgR activity. 

1 5 Chemically modified NgR polypeptide compositions in which the NgR 

polypeptide is linked to a polymer are included within the scope of the present 
invention. The polymer may be water soluble to prevent precipitation of the protein in 
an aqueous environment, such as a physiological environment. Suitable water-soluble 
polymers may be selected from the group consisting of, for example, polyethylene 
20 glycol (PEG), monomethoxypolyethylene glycol, dextran, cellulose, or other 
carbohydrate based polymers, poly-(N-vinyl pyrrolidone) polyethylene glycol, 
polypropylene glycol homopolymers, a polypropylene oxide/ethylene oxide copolymer 
polyoxyethylated polyols (e.g. glycerol) and polyvinyl alcohol. The selected polymer is 
usually modified to have a single reactive group, such as an active ester for acylation or 
25 an aldehyde for alkylation, so that the degree of polymerization may be controlled. 

Polymers may be of any molecular weight, and may be branched or unbranched, and 
mixtures of such polymers may also be used. When the chemically modified NgR 
polymer is destined for therapeutic use, pharmaceutically acceptable polymers will be 
selected for use. 

30 When the polymer is to be modified by an acylation reaction, the polymer 

should have a single reactive ester group. Alternatively, if the polymer is to be 
modified by reductive alkylation, the polymer should have a single reactive aldehyde 
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group. A preferred reactive aldehyde is polyethylene glycol propionaldehyde, which is 
water stable, or mono Cl-CIO alkoxy or aryloxy derivatives thereof (see U.S. Patent 
No. 5,252,714, incorporated by reference herein in its entirety). 

Pegylation of NgR polypeptides may be carried out by any of the pegylation 
5 reactions known in the art, as described, for example, in the following references: 

Focus on Growth Factors 3, 4-10 (1992); EP 0 154 316; and EP 0 401 384 (each of 
which is incorporated by reference herein in its entirety). Preferably, the pegylation is 
carried out via an acylation reaction or an alkylation reaction with a reactive 
polyethylene glycol molecule (or an analogous reactive water-soluble polymer). A 
10 preferred water-soluble polymer for pegylation of polypeptides such as NgR is 
polyethylene glycol (PEG). As used herein, "polyethylene glycol" is meant to 
encompass any of the forms of PEG that have been used to derivatize other proteins, 
such as mono (Cl-CIO) alkoxy- or aryloxy-polyethylene glycol. 

Chemical derivatization of NgR polypeptides may be performed under any 
15 suitable conditions used to react a biologically active substance with an activated 

polymer molecule. Methods for preparing pegylated NgR polypeptides will generally 
comprise the steps of (a) reacting the polypeptide with polyethylene glycol, such as a 
reactive ester or aldehyde derivative of PEG, under conditions whereby NgR 
polypeptide becomes attached to one or more PEG groups, and (b) obtaining the 
20 reaction products. It will be apparent to one of ordinary skill in the art to select the 

optimal reaction conditions or the acylation reactions based on known parameters and 
the desired result. 

Pegylated and other polymer:NgR polypeptides may generally be used to treat 
conditions that may be alleviated or modulated by administration of the NgR 
25 polypeptides described herein. However, the chemically-derivatized polymer:NgR 
polypeptide molecules disclosed herein may have additional activities, enhanced or 
reduced biological activity, or other characteristics, such as increased or decreased 
half-life, as compared to the nonderivatized molecules. The NgR polypeptides, 
fragments thereof, variants and derivatives, may be employed alone, together, or in 
30 combination with other pharmaceutical compositions. The cytokines, growth factors, 
antibiotics, antiinflammatories and/or chemotherapeutic agents as is appropriate for the 
indication being treated. 
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The present invention provides compositions comprising purified polypeptides 
of the invention. Preferred compositions comprise, in addition to the polypeptide of 
the invention, a pharmaceutically acceptable (/'. e . , sterile and non-toxic) liquid, 
semisolid, or solid diluent that serves as a pharmaceutical vehicle, excipient or medium. 

5 Any diluent known in the art may be used. Exemplary diluents include, but are not 
limited to, water, saline solutions, polyoxyethylene sorbitan monolaurate, magnesium 
stearate, methyl- and propylhydroxybenzoate, talc, alginates, starches, lactose, sucrose, 
dextrose, sorbitol, mannitol, glycerol, calcium phosphate, mineral oil and cocoa butter. 

Variants that display ligand binding properties of native NgR and are expressed 
10 at higher levels, as well as variants that provide for constitutively active receptors, are 
particularly useful in assays of the invention; the variants are also useful in assays of the 
invention and in providing cellular, tissue and animal models of diseases/conditions 
characterized by aberrant NgR activity. 

With the knowledge of the nucleotide sequence information disclosed in the 
15 present invention, one skilled in the art can identify and obtain nucleotide sequences 

which encode NgR from different sources (/. e . , different tissues or different organisms) 
through a variety of means well known to the skilled artisan and as disclosed by, for 
example, Sambrook et al, MOLECULAR CLONING: A LABORATORY MANUAL, Second 
Edition, Cold Spring Harbor Press, Cold Spring Harbor, NY (1989), which is 
20 incorporated herein by reference in its entirety. 

For example, DNA that encodes NgR may be obtained by screening of mRNA, 
cDNA, or genomic DNA with oligonucleotide probes generated from the NgR gene 
sequence information provided herein. Probes may be labeled with a detectable group, 
such as a fluorescent group, a radioactive atom or a chemiluminescent group in 
25 accordance with procedures known to the skilled artisan and used in conventional 
hybridization assays, as described by, for example, Sambrook et al. (1989) above. 

A nucleic acid molecule comprising any of the NgR nucleotide sequences 
described above can alternatively be synthesized by use of the polymerase chain 
reaction (PCR) procedure, with the PCR oligonucleotide primers produced from the 
30 nucleotide sequences provided herein. See U. S. Patent Nos. 4,683, 195 to Mullis et al. 
and 4,683,202 to Mullis. The PCR reaction provides a method for selectively 
increasing the concentration of a particular nucleic acid sequence even when that 
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sequence has not been previously purified and is present only in a single copy in a 
particular sample. The method can be used to amplify either single- or 
double-stranded DNA. The essence of the method involves the use of two 
oligonucleotide probes to serve as primers for the template-dependent, polymerase- 
5 mediated replication of a desired nucleic acid molecule. 

A wide variety of alternative cloning and in vitro amplification methodologies 
are well known to those skilled in the art. Examples of these techniques are found in, 
for example, Berger et al.. Guide to Molecular Cloning Techniques, METHODS IN 
ENZYMOLOGY 152 Academic Press, San Diego, CA, which is incorporated herein by 

10 reference in its entirety. 

The nucleic acid molecules of the present invention, and fragments derived 
therefrom, are useful for screening for restriction fragment length polymorphism 
(RFLP) associated with certain disorders, as well as for genetic mapping. 

15 Antibodies 

Also comprehended by the present invention are antibodies (e.g., monoclonal 
and polyclonal antibodies, single chain antibodies, chimeric antibodies, 
bifunctional/bispecific antibodies, humanized antibodies, human antibodies, and 
complementary determining region (CDR)-grafted antibodies, including compounds 
20 which include CDR sequences which specifically recognize a polypeptide of the 

invention) specific for NgR or fragments thereof. Preferred antibodies of the invention 
are human antibodies which are produced and identified according to methods 
described in W093/1 1236, published June 20, 1993, which is incorporated herein by 
reference in its entirety. Antibody fragments, including Fab, Fab', F(ab') 2 , and F v , are 
25 also provided by the invention. The term "specific for," when used to describe 

antibodies of the invention, indicates that the variable regions of the antibodies of the 
invention recognize and bind NgR polypeptides exclusively (/. e . , are able to distinguish 
NgR polypeptides from other known NgR polypeptides by virtue of measurable 
differences in binding affinity, despite the possible existence of localized sequence 
30 identity, homology, or similarity between NgR and such polypeptides). 

The antigenic peptide of NgR comprises at least 8 amino acid residues of the 
amino acid sequence shown in SEQ ID NO:2, 4 or 14 and encompasses an epitope of 
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NgR such that an antibody raised against the peptide forms a specific immune complex 
with NgR. Preferably, the antigenic peptide comprises at least 10 amino acid residues, 
more preferably at least 15 amino acid residues, even more preferably at least 20 amino 
acid residues, and most preferably at least 30 amino acid residues. Preferred epitopes 
5 encompassed by the antigenic peptide are regions of NgR that are located on the 
surface of the protein, e.g., hydrophilic regions. 

It will be understood that specific antibodies may also interact with other 
proteins (for example, S. aureus protein A or other antibodies in ELISA techniques) 
through interactions with sequences outside the variable region of the antibodies, and, 
10 in particular, in the constant region of the molecule. Screening assays to determine 
binding specificity of an antibody of the invention are well known and routinely 
practiced in the art. For a comprehensive discussion of such assays, see Harlow et al. 
in Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press; Cold 
Spring Harbor, NY (1988), Chapter 6. Antibodies that recognize and bind fragments 
15 of the NgR polypeptides of the invention are also contemplated, provided that the 

antibodies are specific for NgR polypeptides. Antibodies of the invention can be 
produced using any method well known and routinely practiced in the art. 

For the production of polyclonal antibodies, various suitable host animals (e.g., 
rabbit, goat, mouse or other mammal) may be immunized by injection with the native 
20 protein, or a synthetic variant thereof, or a derivative of the foregoing. An appropriate 
immunogenic preparation can contain, for example, recombinantly expressed NgR 
protein or a chemically synthesized NgR polypeptide. The preparation can further 
include an adjuvant. Various adjuvants used to increase the immunological response 
include, but are not limited to, Freund's (complete and incomplete), mineral gels (e.g., 
25 aluminum hydroxide), surface active substances (e.g. , lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, dinitrophenol, etc.), human adjuvants such as 
Bacille Calmette-Guerin and Corynebacterium parvum or similar immunostimulatory 
agents. If desired, the antibody molecules directed against NgR can be isolated from 
the mammal (e.g., from the blood) and further purified by well known techniques, such 
30 as protein A chromatography to obtain the IgG fraction. 

The term "monoclonal antibody” or "monoclonal antibody composition," as 
used herein, refers to a population of antibody molecules that contain only one species 
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of an antigen binding site capable of immunoreacting with a particular epitope of NgR. 
A monoclonal antibody composition thus typically displays a single binding affinity for 
a particular NgR protein with which it immunoreacts. For preparation of monoclonal 
antibodies directed towards a particular NgR protein, or derivatives, fragments, 

5 analogs or homologs thereof, any technique that provides for the production of 

antibody molecules by continuous cell line culture may be utilized. Such techniques 
include, but are not limited to, the hybridoma technique (see Kohler and Milstein 
(1975) Nature 256, 495-497); the trioma technique; the human B-cell hybridoma 
technique (see Kozbor et al., (1983) Immunol. Today 4, 72) and the EBV hybridoma 
10 technique to produce human monoclonal antibodies (see Cole et al, (1985) in 

Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 
Human monoclonal antibodies may be utilized in the practice of the present invention 
and may be produced by using human hybridomas (see Cote et al., (1983) Proc. Natl. 
Acad. Sci. USA 80, 2026-2030) or by transforming human B-cells with Epstein Barr 
15 Virus in vitro (see Cole et al., (1985), above). 

According to the invention, techniques can be adapted for the production of 
single-chain antibodies specific to a NgR protein (see e.g., U.S. Patent No. 4,946,778). 
In addition, methods can be adapted for the construction of Fab expression libraries 
(see e.g., Huse et al., (1989) Science 246, 1275-1281) to allow rapid and effective 
20 identification of monoclonal F ab fragments with the desired specificity for a NgR 

protein or derivatives, fragments, analogs or homologs thereof. Non-human antibodies 
can be "humanized" by techniques well known in the art. See e.g., U.S. Patent No. 
5,225,539. In one method, the non-human CDRs are inserted into a human antibody 
or consensus antibody framework sequence. Further changes can then be introduced 
25 into the antibody framework to modulate affinity or immunogenicity. Antibody 

fragments that contain the idiotypes to a NgR protein may be produced by techniques 
known in the art including, but not limited to: (i) an F(ab') 2 fragment produced by 
pepsin digestion of an antibody molecule; (ii) an Fab fragment generated by reducing 
the disulfide bridges of an F(ab') 2 fragment; (iii) an Fab fragment generated by the 
30 treatment of the antibody molecule with papain and a reducing agent and (iv) F v 
fragments. 
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Additionally, recombinant anti-NgR antibodies, such as chimeric and 
humanized monoclonal antibodies, comprising both human and non-human portions, 
which can be made using standard recombinant DNA techniques, are within the scope 
of the invention. Such chimeric and humanized monoclonal antibodies can be 
5 produced by recombinant DNA techniques known in the art, for example using 

methods described in PCT International Application No. PCT/US86/02269; European 
Patent Application No. 184,187; European Patent Application No. 171,496; European 
Patent Application No. 173,494; PCT International Publication No. WO 86/01533; 
U.S. Pat. No. 4,816,567; European Patent Application No. 125,023; Better et al., 

10 (1988) Science 240, 1041-1043; Liu et al., (1987) Proc. Natl. Acad. Sci. USA 84, 

3439-3443; Liu et al., (1987) J. Immunol. 139, 3521-3526; Sun et al, (1987) Proc. 
Natl. Acad. Sci. USA 84, 214-218; Nishimura et al., (1987) Cancer Res. 47, 999-1005 
Wood et al., (1985 ) Nature 314, 446-449; Shaw et al,. (1988) J. Natl. Cancer Inst. 

80, 1553-1559); Morrison (1985) Science 229, 1202-1207; Oi etal., (1986) 

15 BioTechniques 4, 214; U.S. Patent. No. 5,225,539; Jones et al., (1986) Nature 321, 
552-525; Verhoeyan et al., (1988) Science 239, 1534; and Beidler et al., (1988) J. 
Immunol. 141, 4053-4060. 

In a preferred embodiment of the invention a portion of a NgR is joined to an 
Fc portion of an antibody to form a NgR/Fc fusion protein. Preferably, the Ig fusion 
20 protein is soluble. The NgR/Fc fusion protein may be formed by recombinant 

techniques as described above. In one embodiment, a portion of a NgR including the 
entire amino acid sequence of NgR except the C-terminal hydrophobic region is fused 
to an Fc portion of an antibody. In preferred embodiments, the NgR is a human NgR 
and the Fc is also human. More preferably, the human Fc portion is derived from an 
25 IgG antibody. In other embodiments, the N-terminal signal sequence is omitted. Such 
antibodies are useful in binding Nogo to prevent Nogo signaling through the NgR. 

In one embodiment, methods for the screening of antibodies that possess the 
desired specificity include, but are not limited to, enzyme-linked immunosorbent assay 
(ELISA) and other immunologically-mediated techniques known within the art. In a 
30 specific embodiment, selection of antibodies that are specific to a particular domain of 
a NgR protein is facilitated by generation of hybridomas that bind to the fragment of a 
NgR protein possessing such a domain. Antibodies that are specific for one or more 
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domains within aNgR protein, e.g., domains spanning the above-identified conserved 
regions of NgRs, or derivatives, fragments analogs or homologs thereof, are also 
provided herein. 

Anti-NgR antibodies may be used in methods known within the art relating to 
5 the localization and/or quantitation of a NgR protein (e.g., for use in measuring levels 

of the NgR protein within appropriate physiological samples, for use in diagnostic 
methods, for use in imaging the protein, and the like). In a given embodiment, 
antibodies for NgR proteins, or derivatives, fragments analogs or homologs thereof, 
that contain the antibody derived binding domain, are utilized as pharmacologically- 
10 active compounds [hereinafter "Therapeutics"]. 

An anti-NgR antibody (e.g., monoclonal antibody) can be used to isolate NgR 
by standard techniques, such as affinity chromatography or immunoprecipitation. An 
anti-NgR antibody can facilitate the purification of natural NgR from cells and of 
recombinantly produced NgR expressed in host cells. Moreover, an anti-NgR antibody 
15 can be used to detect NgR protein (e.g., in a cellular lysate or cell supernatant) in order 
to evaluate the abundance and pattern of expression of the NgR protein. Anti-NgR 
antibodies can be used diagnostically to monitor protein levels in tissue as part of a 
clinical testing procedure, e.g., to, for example, determine the efficacy of a given 
treatment regimen. Detection can be facilitated by coupling (i.e., physically linking) 

20 the antibody to a detectable substance. Examples of detectable substances include 
various enzymes, prosthetic groups, fluorescent materials, luminescent materials, 
bioluminescent materials and radioactive materials. Examples of suitable enzymes 
include horseradish peroxidase, alkaline phosphatase, p-galactosidase, or 
acetylcholinesterase; examples of suitable prosthetic group complexes include 
25 streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 

umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, 
dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a 
luminescent material includes luminol; examples of bioluminescent materials include 
luciferase, luciferin and aequorin, and examples of suitable radioactive material include 
30 125 I, 131 I, 35 S or 3 H. 

Another aspect of the present invention is directed to methods of inducing an 
immune response in a mammal against a polypeptide of the invention by administering 
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to the mammal an amount of the polypeptide sufficient to induce an immune response. 
The amount will be dependent on the animal species, size of the animal, and the like 
but can be determined by those skilled in the art. 

Another aspect of the invention is directed to anti-idiotypic antibodies and 
5 anti-anti-idiotypic antibodies. An anti-idiotypic antibody is an antibody that recognizes 
determinants of another antibody (a target antibody). Generally, the anti-idiotypic 
antibody recognizes determinants of the antigen-binding site of the target antibody. 
Typically, the target antibody is a monoclonal antibody. An anti-idiotypic antibody is 
generally prepared by immunizing an animal (particularly, mice) of the same species 
10 and genetic type as the source of the target monoclonal antibody, with the target 
monoclonal antibody. The immunized animal mounts an immune response to the 
idiotypic determinants of the target monoclonal antibody and produces antibodies 
against the idiotypic determinants of the target monoclonal antibody. 
Antibody-producing cells, such as splenic cells, of the immunized animal may be used 
15 to generate anti-idiotypic monoclonal antibodies. Furthermore, an anti-idiotypic 
antibody may also be used to immunize animals to produce anti-anti-idiotypic 
antibodies. These immunized animals may be used to generate anti-anti-idiotypic 
monoclonal antibodies using standard techniques. The anti-anti-idiotypic antibodies 
may bind to the same epitope as the original, target monoclonal antibody used to 
20 prepare the anti-idiotypic antibody. The anti-anti-idiotypic antibodies represent other 
monoclonal antibodies with the same antigen specificity as the original target 
monoclonal antibody. 

If the binding of the anti-idiotypic antibody with the target antibody is inhibited 
by the relevant antigen of the target antibody, and if the anti-idiotypic antibody induces 
25 an antibody response with the same specificity as the target antibody, it mimics the 
antigen of the target antibody. Such an anti-idiotypic antibody is an "internal image 
anti-idiotype" and is capable of inducing an antibody response as if it were the original 
antigen. (Bona and Kohler (1984) ANTI-IDIOTYPIC ANTIBODIES AND INTERNAL IMAGE, 
in Monoclonal and anti-idiotypic antibodies: Probes for receptor structure 
30 AND function. Venter J.C. et al. (Eds), Alan R. Liss, New York, NY, pp 141-149, 
1984). Vaccines incorporating internal image anti-idiotype antibodies have been 
shown to induce protective responses against viruses, bacteria, and parasites (Kennedy 
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etal, (1986)232, 220-223; 1047; McNamara etal., (1985) Science 226, 1325-1326). 
Internal image anti-idiotypic antibodies have also been shown to induce immunity to 
tumor related antigens (Raychauhuri et al, (1986) J. Immunol. 137, 1743-1749; 
Raychauhuri et al., (1987) J. Immunol. 139, 3902-3910; Bhattacharya-Chatterjee et 
5 al., (1987) J. Immunol. 139, 1354-1360; Bhattacharya-Chatterjee et al., (1988) J. 
Immunol. 141, 1398-1403; Herlyn. et al. (1989) Intern. Rev. Immunol. 4, 347-357; 
Chen et al. (1990) Cell Imm. Immunother. Cancer 351-359; Herlyn et al., (1991) in 
vivo 5, 615-624; Furuya et al. (1992) AntiCancer Res. 12, 27-32; Mittelman, A. etal. 
(1992) Proc. Natl. Acad. Sci., USA 89, 466-470; Durrant. etal., (1994) Cancer Res. 
10 54, 4837-4840; Mittelman. et al. (1994) Cancer Res. 54, 415-421; Schmitt, etal. 

(1994) Hybridoma 13 , 389-396; Chakrobarty. etal. (1995 ) J. Immunother. 18, 
95-103; Chakrobarty. et al. (1995) Cancer Res. 55, 1525-1530; Foon, K. A. etal. 
(1995) Clin. Cancer Res. 1, 1205-1294; Herlyn et al. (1995) Hybridoma 14, 159-166; 
Sclebusch et al. (1995) Hybridoma 14, 167-174; Herlyn. etal. (1996) Cancer 
15 Immunol Immunother. 43, 65-76). 

Anti-idiotypic antibodies for NgR may be prepared, for example, by 
immunizing an animal, such as a mouse, with a immunogenic amount of a composition 
comprising NgR2 (SEQ ID NO:2), NgR3 (SEQ ID NOs:4 or 14), or immunogenic 
portion thereof, containing at least one antigenic epitope of NgR. The composition 
20 may also contain a suitable adjuvant, and any carrier necessary to provide 

immunogenicity. Monoclonal antibodies recognizing NgR may be prepared from the 
cells of the immunized animal as described above. A monoclonal antibody recognizing 
an epitope of NgR is then selected and used to prepare a composition comprising an 
immunogenic amount of the anti-NgR monoclonal antibody. Typically, a 25 to 200 pg 
25 dose of purified anti-NgR monoclonal would be sufficient in a suitable adjuvant. 

Animals may be immunized 2-6 times at 14 to 30 day intervals between doses. 
Typically, animals are immunized by any suitable route of administration, such as 
intraperitoneal, subcutaneous, intravenous or a combination of these. Anti-idiotypic 
antibody production may be monitored during the immunization period using standard 
30 immunoassay methods. Animals with suitable titers of antibodies reactive with the 

target monoclonal antibodies may be reimmunized with the monoclonal antibody used 
as the immunogen three days before harvesting the antibody producing cells. 
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Preferably, spleen cells are used, although other antibody producing cells may be 
selected. Antibody-producing cells are harvested and fused with myeloma cells to 
produce Hybridomas, as described above, and suitable anti-idiotypic 
antibody-producing cells are selected. 

5 Anti-anti-idiotypic antibodies are produced by another round of immunization 

and Hybridoma production by using the anti-idiotypic monoclonal antibody as the 
immunogen. 

Antibodies of the invention are useful for, e.g., therapeutic purposes (by 
modulating activity of NgR), diagnostic purposes to detect or quantitate NgR, and 
10 purification of NgR. Therefore, kits comprising an antibody of the invention for any of 
the purposes described herein are also comprehended. 

Kits 

The present invention is also directed to kits, including pharmaceutical kits. 

15 The kits can comprise any of the nucleic acid molecules described above, any of the 
polypeptides described above, or any antibody which binds to a polypeptide of the 
invention as described above, as well appropriate controls, such as positive and/or 
negative controls. The kit preferably comprises additional components, such as, for 
example, instructions, solid support, reagents helpful for quantification, and the like. 

20 For example, the kit can comprise: a labeled compound or agent capable of detecting 
NgR protein or mRNA in a biological sample; means for determining the amount of 
NgR in the sample; and means for comparing the amount of NgR in the sample with a 
standard. The compound or agent can be packaged in a suitable container. 

25 Screening Assays 

The DNA and amino acid sequence information provided by the present 
invention also makes possible identification of binding partner compounds with which 
a NgR polypeptide or polynucleotide will interact. Methods to identify binding partner 
compounds include solution assays, in vitro assays wherein NgR polypeptides are 
30 immobilized and cell-based assays. Identification of binding partner compounds of 
NgR polypeptides provides candidates for therapeutic or prophylactic intervention in 
pathologies associated with NgR normal and aberrant biological activity. 
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The invention also provides a method (also referred to herein as a "screening 
assay") for identifying modulators, i.e., candidate or test compounds or agents (e.g., 
peptides, peptidomimetics, small molecules (e.g. , molecules of less than 1,000 Daltons) 
or other drugs) that bind to NgR proteins or have a stimulatory or inhibitory effect on, 

5 for example, NgR expression or NgR activity. 

In one embodiment, the invention provides assays for screening candidate or 
test compounds which bind to or modulate the activity of a NgR protein or 
polypeptide or biologically active portion thereof. The test compounds of the present 
invention can be obtained using any of the numerous approaches in combinatorial 
10 library methods known in the art, including: biological libraries; spatially addressable 
parallel solid phase or solution phase libraries; synthetic library methods requiring 
deconvolution; the "one-bead one-compound" library method; and synthetic library 
methods using affinity chromatography selection. The biological library approach is 
limited to peptide libraries, while the other four approaches are applicable to peptide, 

15 non-peptide oligomer or small molecule libraries of compounds (Lam (1997) 
Anticancer Drug Des. 12, 145). 

Examples of methods for the synthesis of molecular libraries can be found in 
the art, for example in: DeWitt et al., (1993) Proc. Natl. Acad. Sci. USA 90, 6909; Erb 
et al., (1994 ) Proc. Natl. Acad. Sci. USA 91,11422; Zuckermann et al. (1994) J. Med. 
20 Chem 37, 2678; Cho et al., (1993) Science 261, 1303; Carrell etal., (1994) Angew 
Chem. Int. Ed. Engl. 33, 2059; Carell et al., (1994) Angew Chem. Int. Ed. Engl. 33, 
2061; and Gallop etal., (1994) J. Med. Chem 37, 1233. 

Libraries of compounds may be presented in solution (e.g., Houghten (1992) 
BioTechniques 13, 412-421), or on beads (Lam (1991) Nature 354, 82-84), on chips 
25 (Fodor (1993) Nature 364, 555-556), bacteria (Ladner, U.S. Patent No. 5,223,409), 
spores (Ladner, above), plasmids (Cull et al. (1992) Proc. Natl. Acad. Sci. USA 89, 
1865-1869) or on phage (Scott and Smith (1990) Science 249, 386-390; Devlin (1990) 
Science 249, 404-406; Cwirla etal. (1990) Proc. Natl. Acad. Sci. USA 87, 6378-6382; 
Felici (1991) J. Mol. Biol. 222, 301-310; Ladner, above). 


30 
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1. Cell-based Assays 

The invention also provides cell-based assays to identify binding partner 
compounds of a NgR polypeptide. In one embodiment, the invention provides a 
method comprising the steps of contacting a NgR polypeptide expressed on the surface 
5 of a cell with a candidate binding partner compound and detecting binding of the 

candidate binding partner compound to the NgR polypeptide. In another embodiment, 
an assay is a cell-based assay comprising contacting a cell expressing a membrane- 
bound form of NgR protein, or a biologically active portion thereof, on the cell surface 
with a test compound and determining the ability of the test compound to modulate 
10 (e.g., stimulate or inhibit) the activity of the NgR protein or biologically active portion 

thereof. 

In one embodiment, an assay is a cell-based assay in which a cell which 
expresses a membrane-bound form of NgR protein, or a biologically active portion 
thereof, on the cell surface is contacted with a test compound and the ability of the test 
1 5 compound to bind to a NgR protein determined. The cell, for example, can be of 

mammalian origin or a yeast cell. Determining the ability of the test compound to bind 
to the NgR protein can be accomplished, for example, by coupling the test compound 
with a radioisotope or enzymatic label such that binding of the test compound to the 
NgR protein or biologically active portion thereof can be determined by detecting the 
20 labeled compound in a complex. For example, test compounds can be labeled with 
125 I, 35 S, 14 C, or 3 H, either directly or indirectly, and the radioisotope detected by 
direct counting of radioemission or by scintillation counting. Alternatively, test 
compounds can be enzymatically labeled with, for example, horseradish peroxidase, 
alkaline phosphatase or luciferase, and the enzymatic label detected by determination 
25 of conversion of an appropriate substrate to product. In one embodiment, the assay 
comprises contacting a cell which expresses a membrane-bound form of NgR protein 
or a biologically active portion thereof, on the cell surface with a known compound 
which binds NgR to form an assay mixture, contacting the assay mixture with a test 
compound, and determining the ability of the test compound to interact with a NgR 
30 protein, wherein determining the ability of the test compound to interact with a NgR 

protein comprises determining the ability of the test compound to preferentially bind to 
NgR or a biologically active portion thereof as compared to the known compound. 
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Determining the ability of the test compound to modulate the activity of NgR 
or a biologically active portion thereof can be accomplished, for example, by 
determining the ability of the NgR protein to bind to or interact with a NgR target 
molecule. As used herein, a "target molecule" is a molecule with which a NgR protein 
5 binds or interacts in nature, for example, a molecule on the surface of a cell which 

expresses a NgR protein, a molecule on the surface of a second cell, a molecule in the 
extracellular milieu, a molecule associated with the internal surface of a cell membrane 
or a cytoplasmic molecule. A NgR target molecule can be a non-NgR molecule or a 
NgR protein or polypeptide of the present invention. In one embodiment, a NgR 
1 0 target molecule is a component of a signal transduction pathway that facilitates 

transduction of an extracellular signal (e.g., a signal generated by binding of a 
compound to a membrane-bound NgR molecule) through the cell membrane and into 
the cell. The target, for example, can be a second intercellular protein that has 
catalytic activity or a protein that facilitates the association of downstream signaling 
15 molecules with NgR. In a preferred embodiment, the detection comprises detecting a 
calcium flux or other physiological event in the cell caused by the binding of the 
molecule. 

Specific binding molecules, including natural ligands and synthetic compounds, 
can be identified or developed using isolated or recombinant NgR products, NgR 
20 variants, or preferably, cells expressing such products. Binding partners are useful for 
purifying NgR products and detection or quantification of NgR products in fluid and 
tissue samples using known immunological procedures. Binding molecules are also 
manifestly useful in modulating (/'. e . , blocking, inhibiting or stimulating) biological 
activities of NgR, especially those activities involved in signal transduction. 

25 

2. Cell-free Assays 

(a) Direct binding: 

The invention includes several assay systems for identifying NgR binding 
partners. In solution assays, methods of the invention comprise the steps of (a) 

30 contacting a NgR polypeptide with one or more candidate binding partner compounds 
and (b) identifying the compounds that bind to the NgR polypeptide. Identification of 
the compounds that bind the NgR polypeptide can be achieved by isolating the NgR 
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polypeptide/binding partner complex and separating the binding partner compound 
from the NgR polypeptide. An additional step of characterizing the physical, biological 
and/or biochemical properties of the binding partner compound is also comprehended 
in another embodiment of the invention. In one aspect, the NgR polypeptide/binding 
5 partner complex is isolated using an antibody immunospecific for either the NgR 
polypeptide or the candidate binding partner compound. 

In still other embodiments, either the NgR polypeptide or the candidate binding 
partner compound comprises a label or tag that facilitates its isolation, and methods of 
the invention to identify binding partner compounds include a step of isolating the NgR 
10 polypeptide/binding partner complex through interaction with the label or tag. An 
exemplary tag of this type is a poly-histidine sequence, generally around six histidine 
residues, that permits isolation of a compound so labeled using nickel chelation. Other 
labels and tags, such as the FLAG® tag (Eastman Kodak, Rochester, NY), well known 
and routinely used in the art, are embraced by the invention. 

15 

(b) Immobilized NgR 

In one variation of an in vitro assay, the invention provides a method 
comprising the steps of (a) contacting an immobilized NgR polypeptide, or a 
biologically active fragment thereof with a candidate binding partner compound and (b) 
20 detecting binding of the candidate compound to the NgR polypeptide. In an 

alternative embodiment, the candidate binding partner compound is immobilized and 
i binding of NgR is detected. Immobilization is accomplished using any of the methods 
well known in the art, including covalent bonding to a support, a bead or a 
chromatographic resin, as well as non-covalent, high affinity interactions such as 
25 antibody binding, or use of streptavidin/biotin binding wherein the immobilized 
compound includes a biotin moiety. Binding of a test compound to NgR, or 
interaction of NgR with a target molecule in the presence and absence of a candidate 
compound, can be accomplished in any vessel suitable for containing the reactants. 
Examples of such vessels include microtiter plates, test tubes, and micro-centrifuge 
30 tubes. In one embodiment, a fusion protein can be provided that adds a domain that 
allows one or both of the proteins to be bound to a matrix. For example, and not by 
way of limitation, GST-NgR fusion proteins or GST-target fusion proteins can be 
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adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or 
glutathione derivatized microtiter plates, that are then combined with the test 
compound or the test compound and either the non-adsorbed target protein or NgR 
protein, and the mixture is incubated under conditions conducive to complex formation 
5 (e.g-., at physiological conditions for salt and pH). Following incubation, the beads or 

microtiter plate wells are washed to remove any unbound components, the matrix 
immobilized in the case of beads, and the complexes determined either directly or 
indirectly, for example, as described above. Alternatively, the complexes can be 
dissociated from the matrix, and the level of NgR binding or activity determined using 
10 standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either NgR or its target molecule can 
be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated NgR or 
target molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using 
15 techniques well known in the art (e.g . , biotinylation kit. Pierce Chemicals, Rockford, 
IL), and immobilized in the wells of streptavidin-coated 96 well plates (Pierce 
Chemical). Alternatively, antibodies reactive with NgR or target molecules, but which 
do not interfere with binding of the NgR protein to its target molecule, can be 
derivatized to the wells of the plate, and unbound target or NgR trapped in the wells 
20 by antibody conjugation. Methods for detecting such complexes, in addition to those 
described above for the GST-immobilized complexes, include immunodetection of 
complexes using antibodies reactive with the NgR or target molecule, as well as 
enzyme-linked assays that rely on detecting an enzymatic activity associated with the 
NgR or target molecule. 

25 Detection of binding can be accomplished (i) using a radioactive label on the 

compound that is not immobilized, (ii) using of a fluorescent label on the 
non-immobilized compound, (iii) using an antibody immunospecific for the 
non-immobilized compound, (iv) using a label on the non-immobilized compound that 
excites a fluorescent support to which the immobilized compound is attached, (v) 

30 determining the activity of the NgR, as well as other techniques well known and 
routinely practiced in the art. 
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Determining the activity of the target molecule, for example, may be 
accomplished by detecting induction of a cellular second messenger of the target (i.e. 
intracellular Ca 2+ , diacylglycerol, IP3, etc.), detecting catalytic/enzymatic activity of the 
target an appropriate substrate, detecting the induction of a reporter gene (comprising 
5 a NgR-responsive regulatory element operatively linked to a nucleic acid encoding a 
detectable marker, e.g., luciferase), or detecting a cellular response, for example, cell 
survival, cellular differentiation, or cell proliferation. 

(c) Competition experiments 

10 In yet another embodiment, the assay comprises contacting the NgR protein or 

biologically active portion thereof with a known compound which binds NgR to form 
an assay mixture, contacting the assay mixture with a test compound, and determining 
the ability of the test compound to interact with a NgR protein, wherein determining 
the ability of the test compound to interact with a NgR protein comprises determining 
15 the ability of the test compound to preferentially bind to NgR or biologically active 
portion thereof as compared to the known compound. 

In yet another embodiment, the cell-free assay comprises contacting the NgR 
protein or biologically active portion thereof with a known compound which binds 
NgR to form an assay mixture, contacting the assay mixture with a test compound, and 
20 determining the ability of the test compound to interact with a NgR protein, wherein 

determining the ability of the test compound to interact with a NgR protein comprises 
determining the ability of the NgR protein to modulate the activity of a NgR target 
molecule. 

The cell-free assays of the present invention are amenable to use of both the 
25 soluble form or the membrane-bound form of NgR. In the case of cell-free assays 
comprising the membrane-bound form of NgR, it may be desirable to utilize a 
solubilizing agent such that the membrane-bound form of NgR is maintained in 
solution. Examples of such solubilizing agents include non-ionic detergents such as n- 
octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, octanoyl-N-methylglucamide, 
30 decanoyl-N-methylglucamide, Triton® X-100, Triton® X-114, Thesit®, 

Isotridecypoly(ethylene glycol ether) n , 3-(3-cholamidopropyl)dimethylamminiol-l- 
propane sulfonate (CHAPS), 3-(3-cholamidopropyl)dimethylamminiol-2-hydroxy-l- 
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propane sulfonate (CHAPSO), or N-dodecyl-N,N-dimethyl-3-ammonio-l -propane 
sulfonate. 

Modulators 

5 Agents that modulate (i.e., increase, decrease, or block) NgR activity or 

expression may be identified by incubating a putative modulator with a cell containing 
a NgR polypeptide or polynucleotide and determining the effect of the putative 
modulator on NgR activity or expression. The selectivity of a compound that 
modulates the activity of NgR can be evaluated by comparing its effects on NgR to its 
10 effect on other NgR compounds. Selective modulators may include, for example, 

antibodies and other proteins, peptides or organic molecules which specifically bind to 
a NgR polypeptide or a NgR-encoding nucleic acid. Modulators of NgR activity will 
be therapeutically useful in treatment of diseases and physiological conditions in which 
normal or aberrant NgR activity is involved. NgR polynucleotides, polypeptides and 
15 modulators may be used in the treatment of such diseases and conditions associated 

with demyelination. NgR polynucleotides and polypeptides, as well as NgR 
modulators, may also be used in diagnostic assays for such diseases or conditions. 

Methods of the invention to identify modulators include variations on any of 
the methods described above to identify binding partner compounds, the variations 
20 including techniques wherein a binding partner compound has been identified and the 
binding assay is carried out in the presence and absence of a candidate modulator. A 
modulator is identified in those instances where binding between the NgR polypeptide 
and the binding partner compound changes in the presence of the candidate modulator 
compared to binding in the absence of the candidate modulator compound. A 
25 modulator that increases binding between the NgR polypeptide and the binding partner 
compound is described as an enhancer or activator, and a modulator that decreases 
binding between the NgR polypeptide and the binding partner compound is described 
as an inhibitor. 

In another embodiment, modulators of NgR expression may be identified in a 
30 method wherein a cell is contacted with a candidate compound and the expression of 
NgR mRNA or protein in the cell is determined. The level of expression of NgR 
mRNA or protein in the presence of the candidate compound is compared to the level 
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of expression of NgR mRNA or protein in the absence of the candidate compound. 

The candidate compound can then be identified as a modulator of NgR expression 
based on this comparison. For example, when expression of NgR mRNA or protein is 
greater (statistically significantly greater) in the presence of the candidate compound 
5 than in its absence, the candidate compound is identified as a stimulator of NgR mRNA 
or protein expression. Alternatively, when expression of NgR mRNA or protein is less 
(statistically significantly less) in the presence of the candidate compound than in its 
absence, the candidate compound is identified as an inhibitor of NgR mRNA or protein 
expression. The level of NgR mRNA or protein expression in the cells can be 
10 determined by methods described herein for detecting NgR mRNA or protein. 

High Throughput Screening 

The invention also comprehends high-throughput screening (HTS) assays to 
identify compounds that interact with or inhibit biological activity (/'. e . , affect 
15 enzymatic activity, binding activity, etc.) of a NgR polypeptide. HTS assays permit 
screening of large numbers of compounds in an efficient manner. Cell-based HTS 
systems are contemplated to investigate NgR receptor-ligand interaction. HTS assays 
are designed to identify "hits" or "lead compounds" having the desired property, from 
which modifications can be designed to improve the desired property. Chemical 
20 modification of the "hit" or "lead compound" is often based on an identifiable 
structure/activity relationship between the "hit" and the NgR polypeptide. 

Another aspect of the present invention is directed to methods of identifying 
compounds that bind to either NgR or nucleic acid molecules encoding NgR, 
comprising contacting NgR, or a nucleic acid molecule encoding the same, with a 
25 compound, and determining whether the compound binds NgR or a nucleic acid 

molecule encoding the same. Binding can be determined by binding assays which are 
well known to the skilled artisan, including, but not limited to, gel-shift assays. 

Western blots, radiolabeled competition assay, phage-based expression cloning, 
co-fractionation by chromatography, co-precipitation, cross linking, interaction 
30 trap/two-hybrid analysis, southwestern analysis, ELISA, and the like, which are 

described in, for example, Ausubel et al. (Eds.), Current Protocols nsr Molecular 
Biology, 1999, John Wiley & Sons, NY, which is incorporated herein by reference in 
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its entirety. The NgR proteins, for example, can be used as "bait proteins" in a two- 
hybrid assay or three hybrid assay (see, e.g., U.S. Patent No. 5,283,3 17; Zervos et al . , 
(1993) Cell 72, 223-232; Madura etal., (1993) J Biol. Chem. 268, 12046-12054; 
Bartel et al. , (1993) BioTechniques 14, 920-924; Iwabuchi et al., (1993) Oncogene 8, 

5 1693-1696; and Brent WO 94/10300), to identify other proteins that bind to or interact 

with NgR ("NgR-binding proteins" or "NgR-bp") and modulate NgR activity. Such 
NgR-binding proteins are also likely to be involved in the propagation of signals by the 
NgR proteins as, for example, upstream or downstream elements of the NgR pathway. 

Other assays may be used to identify specific ligands of a NgR receptor, 

10 including assays that identify ligands of the target protein through measuring direct 
binding of test ligands to the target protein, as well as assays that identify ligands of 
target proteins through affinity ultrafiltration with ion spray mass spectroscopy/HPLC 
methods or other physical and analytical methods. Alternatively, such binding 
interactions are evaluated indirectly using the yeast two-hybrid system described in 
15 Fields et al., (1989) Nature 340, 245-246, and Fields et al., (1994) Trends Genet. 10, 
286-292, both of which are incorporated herein by reference. The two-hybrid system 
is a genetic assay based on the modular nature of most transcription factors used for 
detecting interactions between two proteins or polypeptides. It can be used to identify 
proteins that bind to a known protein of interest, or to delineate domains or residues 
20 critical for an interaction. Variations on this methodology have been developed to 
clone genes that encode DNA binding proteins, to identify peptides that bind to a 
protein, and to screen for drugs. The two-hybrid system exploits the ability of a pair of 
interacting proteins to bring a transcription activation domain into close proximity with 
a DNA binding domain that binds to an upstream activation sequence (UAS) of a 
25 reporter gene, and is generally performed in yeast. The assay requires the construction 
of two hybrid genes encoding (1) a DNA-binding domain that is fused to a first protein 
and (2) an activation domain fused to a second protein. The DNA-binding domain 
targets the first hybrid protein to the UAS of the reporter gene; however, because most 
proteins lack an activation domain, this DNA-binding hybrid protein does not activate 
30 transcription of the reporter gene. The second hybrid protein, which contains the 

activation domain, cannot by itself activate expression of the reporter gene because it 
does not bind the UAS. However, when both hybrid proteins are present, the 
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noncovalent interaction of the first and second proteins tethers the activation domain 
to the UAS, activating transcription of the reporter gene. For example, when the first 
protein is a NgR gene product, or fragment thereof, that is known to interact with 
another protein or nucleic acid, this assay can be used to detect agents that interfere 
5 with the binding interaction. Expression of the reporter gene is monitored as different 
test agents are added to the system. The presence of an inhibitory agent results in lack 
of a reporter signal. The compounds to be screened include (which may include 
compounds that are suspected to bind NgR, or a nucleic acid molecule encoding the 
same), but are not limited to, extracellular, intracellular, biological or chemical origin. 
10 The function of the NgR gene product is unclear and no ligands have yet been 

found which bind the gene product. The yeast two-hybrid assay is useful to identify 
proteins that bind to the gene product. In an assay to identify proteins that bind to a 
NgR receptor, or fragment thereof, a fusion polynucleotide encoding both a NgR 
receptor (or fragment) and a UAS binding domain (/'. e . , a first protein) may be used. 

15 In addition, a large number of hybrid genes each encoding a different second protein 
fused to an activation domain are produced and screened in the assay. Typically, the 
second protein is encoded by one or more members of a total cDNA or genomic DNA 
fusion library, with each second protein-coding region being fused to the activation 
domain. This system is applicable to a wide variety of proteins, and it is not even 
20 necessary to know the identity or function of the second binding protein. The system 

. is highly sensitive and can detect interactions not revealed by other methods; even 
transient interactions may trigger transcription to produce a stable mRNA that can be 
repeatedly translated to yield the reporter protein. 

Other assays may be used to search for agents that bind to the target protein. 

25 One such screening method to identify direct binding of test ligands to a target protein 
is described in U.S. Patent No. 5,585,277, incorporated herein by reference. This 
method relies on the principle that proteins generally exist as a mixture of folded and 
unfolded states, and continually alternate between the two states. When a test ligand 
binds to the folded form of a target protein (/. e . , when the test ligand is a ligand of the 
30 target protein), the target protein molecule bound by the ligand remains in its folded 

state. Thus, the folded target protein is present to a greater extent in the presence of a 
test ligand which binds the target protein, than in the absence of a ligand. Binding of 
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the ligand to the target protein can be determined by any method which distinguishes 
between the folded and unfolded states of the target protein. The function of the 
target protein need not be known in order for this assay to be performed. Virtually any 
agent can be assessed by this method as a test ligand, including, but not limited to, 

5 metals, polypeptides, proteins, lipids, polysaccharides, polynucleotides and small 
organic molecules. 

Another method for identifying ligands of a target protein is described in 
Wieboldt et al. (1997) Anal. Chem. 69:1683-1691, incorporated herein by reference. 
This technique screens combinatorial libraries of 20-30 agents at a time in solution 
10 phase for binding to the target protein. Agents that bind to the target protein are 
separated from other library components by simple membrane washing. The 
specifically selected molecules that are retained on the filter are subsequently liberated 
from the target protein and analyzed by HPLC and pneumatically assisted electrospray 
(ion spray) ionization mass spectroscopy. This procedure selects library components 
15 with the greatest affinity for the target protein, and is particularly useful for small 
molecule libraries. 

The methods of the invention also embrace ligands, especially neuropeptides, 
that are attached to a label, such as a radiolabel (e.g., 125 I, 35 S, 32 P, 33 P, 3 H), a 
fluorescence label, a chemiluminescent label, an enzymic label and an immunogenic 
20 label. Modulators falling within the scope of the invention include, but are not limited 
to, non-peptide molecules such as non-peptide mimetics, non-peptide allosteric 
effectors, and peptides. The NgR polypeptide or polynucleotide employed in such a 
test may either be free in solution, attached to a solid support, borne on a cell surface 
or located intracellularly or associated with a portion of a cell. One skilled in the art 
25 can, for example, measure the formation of complexes between NgR and the 
compound being tested. Alternatively, one skilled in the art can examine the 
diminution in complex formation between NgR and its substrate caused by the 
compound being tested. 

Another aspect of the present invention is directed to methods of identifying 
30 compounds which modulate (i. e . , increase or decrease) activity of NgR comprising 
contacting NgR with a compound, and determining whether the compound modifies 
activity of NgR. The activity in the presence of the test compared is measured to the 
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activity in the absence of the test compound. Where the activity of the sample 
containing the test compound is higher than the activity in the sample lacking the test 
compound, the compound will have increased activity. Similarly, where the activity of 
the sample containing the test compound is lower than the activity in the sample 
5 lacking the test compound, the compound will have inhibited activity. 

The present invention is particularly useful for screening compounds by using 
NgR in any of a variety of drug screening techniques. The compounds to be screened 
include (which may include compounds which are suspected to modulate NgR 
activity), but are not limited to, extracellular, intracellular, biologic or chemical origin. 
10 The NgR polypeptide employed in such a test may be in any form, preferably, free in 
solution, attached to a solid support, borne on a cell surface or located intracellularly. 
One skilled in the art can, for example, measure the formation of complexes between 
NgR and the compound being tested. Alternatively, one skilled in the art can examine 
the diminution in complex formation between Nogo-R and its substrate caused by the 
1 5 compound being tested. 

The activity of NgR polypeptides of the invention can be determined by, for 
example, examining the ability to bind or be activated by chemically synthesized 
peptide ligands. Alternatively, the activity of the NgR can be assayed by examining 
their ability to bind calcium ions, hormones, chemokines, neuropeptides, 

20 neurotransmitters, nucleotides, lipids, odorants and photons. Alternatively, the activity 
of the NgR can be determined by examining the activity of effector molecules 
including, but not limited to, adenylate cyclase, phospholipases and ion channels. 

Thus, modulators of NgR activity may alter a NgR receptor function, such as a binding 
property of a receptor or an activity. In various embodiments of the method, the assay 
25 may take the form of an ion flux assay, a yeast growth assay, a non-hydrolyzable GTP 
assay such as a [ S]-GTP S assay, a cAMP assay, an inositol triphosphate assay, a 
diacylglycerol assay, an Aequorin assay, a Luciferase assay, a FLIPR assay for 
intracellular Ca concentration, a mitogenesis assay, a MAP Kinase activity assay, an 
arachidonic acid release assay {e.g . , using [ 3 H]-arachidonic acid) and an assay for 
30 extracellular acidification rates, as well as other binding or function-based assays of 
NgR activity that are generally known in the art. NgR activity can be determined by 
methodologies that are used to assay for FaRP activity, which is well known to those 
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skilled in the art. Biological activities of NgR receptors according to the invention 
include, but are not limited to, the binding of a natural or an unnatural ligand, as well 
as any one of the functional activities of NgRs known in the art. Non-limiting 
examples of NgR activities include transmembrane signaling of various forms, which 
5 may involve phosphatidylinositol (PI) association and/or the exertion of an influence 
over PI; another exemplary activity of NgRs is the binding of accessory proteins or 
polypeptides that differ from known GPI proteins. 

The modulators of the invention exhibit a variety of chemical structures, which 
can be generally grouped into non-peptide mimetics of natural NgR receptor ligands, 

10 peptide and non-peptide allosteric effectors of NgR receptors, and peptides that may 
function as activators or inhibitors (competitive, uncompetitive and non-competitive) 
(e.g., antibody products) of NgR receptors. The invention does not restrict the 
sources for suitable modulators, which may be obtained from natural sources such as 
plant, animal or mineral extracts, or non-natural sources such as small molecule 
15 libraries, including the products of combinatorial chemical approaches to library 
construction, and peptide libraries. 

Other assays can be used to examine enzymatic activity including, but not 
limited to, photometric, radiometric, HPLC, electrochemical, and the like, which are 
described in, for example. Enzyme Assays: A Practical Approach, Eisenthal and 
20 Danson (Eds.), 1992, Oxford University Press, which is incorporated herein by 
reference in its entirety. 

The use of cDNAs in drug discovery programs is well-known; assays capable 
of testing thousands of unknown compounds per day in high-throughput screens 
(HTSs) are thoroughly documented. The literature is replete with examples of the use 
25 of radiolabelled ligands in HTS binding assays for drug discovery (see Williams (1991) 

Med. Res. Rev., 11, 147-184; Sweetnam et al., (1993) J. Nat. Prod. 56, 441-455 for 
review). Recombinant receptors are preferred for binding assay HTS because they 
allow for better specificity (higher relative purity), provide the ability to generate large 
amounts of receptor material, and can be used in a broad variety of formats (see 
30 Hodgson (1992) Bio/T zchnology 10, 973-980; each of which is incorporated herein by 

reference in its entirety). 
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A variety of heterologous systems is available for functional expression of 
recombinant receptors that are well known to those skilled in the art. Such systems 
include bacteria (Strosberg et al. (1992) Trends Pharmacol. Sci. 13, 95-98), yeast 
(Pausch (1997) Trends Biotechnol. 15, 487-494), several kinds of insect cells (Vanden 
5 Broeck (1996) Int. Rev. Cytol. 164, 189-268), amphibian cells (Jayawickreme et al. 
(1997) Curr. Opin. Biotechnol. 8, 629-634) and several mammalian cell lines (CHO, 
HEK293, COS, etc.; see Gerhardt et al. (1997) Eur. J. Pharmacol. 334, 1-23). These 
examples do not preclude the use of other possible cell expression systems, including 
cell lines obtained from nematodes (PCT application WO 98/37177). 

10 In preferred embodiments of the invention, methods of screening for 

compounds which modulate NgR activity comprise contacting test compounds with 
NgR and assaying for the presence of a complex between the compound and NgR. In 
such assays, the ligand is typically labeled. After suitable incubation, free ligand is 
separated from that present in bound form, and the amount of free or uncomplexed 
15 label is a measure of the ability of the particular compound to bind to NgR. 

In another embodiment of the invention, high throughput screening for 
compounds having suitable binding affinity to NgR is employed. Briefly, large 
numbers of different small peptide test compounds are synthesized on a solid substrate. 
The peptide test compounds are contacted with NgR and washed. Bound NgR is then 
20 detected by methods well known in the art. Purified polypeptides of the invention can 
also be coated directly onto plates for use in the aforementioned drug screening 
techniques. In addition, non-neutralizing antibodies can be used to capture the protein 
and immobilize it on the solid support. 

Generally, an expressed NgR can be used for HTS binding assays in 
25 conjunction with its defined ligand. The identified peptide is labeled with a suitable 
radioisotope, including, but not limited to, I, H, S or P, by methods that are 
well known to those skilled in the art. Alternatively, the peptides may be labeled by 
well-known methods with a suitable fluorescent derivative (Baindur et al. (1994) Drug 
Dev. Res. 33, 373-398; Rogers (1997) Drug Discov. Today 2, 156-160). Radioactive 
30 ligand specifically bound to the receptor in membrane preparations made from the cell 
line expressing the recombinant protein can be detected in HTS assays in one of 
several standard ways, including filtration of the receptor-ligand complex to separate 
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bound ligand from unbound ligand (Williams (1991) Med. Res. Rev. 11 , 147-184; 
Sweetnam et al. (1993) J. Nat. Prod. 56, 441-455). Alternative methods include a 
scintillation proximity assay (SPA) or a FlashPlate format in which such separation is 
unnecessary (Nakayama (1998) Curr. Opin. Drug Disc. Dev. 1 , 85-91 Bosse et al. 

5 (1998) J. Biomol. Screening 3, 285-292). Binding of fluorescent ligands can be 

detected in various ways, including fluorescence energy transfer (FRET), direct 
spectrophotofluorometric analysis of bound ligand, or fluorescence polarization 
(Rogers (1997) Drug Discov. Today 2, 156-160; Hill (1998) Curr. Opin. Drug Disc. 
Dev. 1, 92-97). 

10 Examples of such biological responses include, but are not limited to, the 

following: the ability to survive in the absence of a limiting nutrient in specifically 
engineered yeast cells (Pausch (1997) Trends in Biotechnol. 15, 487-494); changes in 
intracellular Ca concentration as measured by fluorescent dyes (Murphy et al. (1998) 
Cur. Opin. Drug Disc. Dev. 1 , 192-199). Fluorescence changes can also be used to 
15 monitor ligand-induced changes in membrane potential or intracellular pH; an 

automated system suitable for HTS has been described for these purposes (Schroeder 
et al. (1996) J. Biomol. Screening 1, 75-80). Melanophores prepared from Xenopus 
laevis show a ligand-dependent change in pigment organization in response to 
heterologous NgR activation; this response is adaptable to HTS formats 
20 (Jayawickreme et al. (1997) Curr. Opin. Biotechnol. 8, 629-634). Assays are also 
available for the measurement of common second messengers, including cAMP, 
phosphoinositides and arachidonic acid, but these are not generally preferred for HTS. 

Preferred methods of HTS employing these receptors include permanently 
transfected CHO cells, in which agonists and antagonists can be identified by the ability 
25 to transduce the signal for the binding of Nogo in membranes prepared from these cells 
through the putative GPI anchor. In another embodiment of the invention, 
permanently transfected CHO cells could be used for the preparation of membranes 
which contain significant amounts of the recombinant receptor proteins; these 
membrane preparations would then be used in receptor binding assays, employing the 
30 radiolabelled ligand specific for the particular receptor. Alternatively, a functional 

24 “ 

assay, such as fluorescent monitoring of ligand-induced changes in internal Ca 
concentration or membrane potential in permanently transfected CHO cells containing 
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each of these receptors individually or in combination would be preferred for HTS. 
Equally preferred would be an alternative type of mammalian cell, such as E0EK293 or 
COS cells, in similar formats. More preferred would be permanently transfected insect 
cell lines, such as Drosophila S2 cells. Even more preferred would be recombinant 
5 yeast cells expressing the Drosophila melanogaster receptors in HTS formats well 
known to those skilled in the art (e.g. , Pausch (1997), above). 

The invention contemplates a multitude of assays to screen and identify 
inhibitors of ligand binding to NgR receptors. In one example, the NgR receptor is 
immobilized and interaction with a binding partner is assessed in the presence and 
10 absence of a candidate modulator such as an inhibitor compound. In another example, 
interaction between the NgR receptor and its binding partner is assessed in a solution 
assay, both in the presence and absence of a candidate inhibitor compound. In either 
assay, an inhibitor is identified as a compound that decreases binding between the NgR 
receptor and its binding partner. Another contemplated assay involves a variation of 
15 the di-hybrid assay wherein an inhibitor of protein/protein interactions is identified by 
detection of a positive signal in a transformed or transfected host cell, as described in 
PCT publication number WO 95/20652, published August 3, 1995. 

Candidate modulators contemplated by the invention include compounds 
selected from libraries of either potential activators or potential inhibitors. There are a 
20 number of different libraries used for the identification of small molecule modulators, 
including: (1) chemical libraries, (2) natural product libraries, and (3) combinatorial 
libraries comprised of random peptides, oligonucleotides or organic molecules. 
Chemical libraries consist of random chemical structures, some of which are analogs of 
known compounds or analogs of compounds that have been identified as "hits" or 
25 "leads" in other drug discovery screens, some of which are derived from natural 
products, and some of which arise from non-directed synthetic organic chemistry. 
Natural product libraries are collections of microorganisms, animals, plants, or marine 
organisms that are used to create mixtures for screening by: (1) fermentation and 
extraction of broths from soil, plant or marine microorganisms or (2) extraction of 
30 plants or marine organisms. Natural product libraries include polyketides, 

non-ribosomal peptides, and variants (non-naturally occurring) thereof. For a review, 
see Cane et al.. Science (1998) 282, 63-68. Combinatorial iibraries are composed of 
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large numbers of peptides, oligonucleotides, or organic compounds as a mixture. 

These libraries are relatively easy to prepare by traditional automated synthesis 
methods, PCR, cloning, or proprietary synthetic methods. Of particular interest are 
non-peptide combinatorial libraries. Still other libraries of interest include peptide, 

5 protein, peptidomimetic, multiparallel synthetic collection, recombinatorial, and 

polypeptide libraries. For a review of combinatorial chemistry and libraries created 
therefrom, see Myers (1997) Curr. Opin. Biotechnol. 8, 701-707. Identification of 
modulators through use of the various libraries described herein permits modification 
of the candidate "hit" (or "lead") to optimize the capacity of the "hit" to modulate 
10 activity. 

Still other candidate inhibitors contemplated by the invention can be designed 
and include soluble forms of binding partners, as well as such binding partners as 
chimeric, or fusion, proteins. A "binding partner" as used herein broadly encompasses 
non-peptide modulators, as well as such peptide modulators as neuropeptides other 
15 than natural ligands, antibodies, antibody fragments, and modified compounds 

comprising antibody domains that are immunospecific for the expression product of the 
identified NgR gene. 

Other embodiments of the invention comprise using competitive screening 
assays in which neutralizing antibodies capable of binding a polypeptide of the 
20 invention specifically compete with a test compound for binding to the polypeptide. In 
this manner, the antibodies can be used to detect the presence of any peptide that 
shares one or more antigenic determinants with NgR. Radiolabeled competitive 
binding studies are described in Lin el al., (1997) Antimicrob. Agents Chemother. 41, 
2127-2131, the disclosure of which is incorporated herein by reference in its entirety. 

25 In other embodiments of the invention, the polypeptides of the invention are 

employed as a research tool for identification, characterization and purification of 
interacting, regulatory proteins. Appropriate labels are incorporated into the 
polypeptides of the invention by various methods known in the art and the 
polypeptides are used to capture interacting molecules. For example, molecules are 
30 incubated with the labeled polypeptides, washed to remove unbound polypeptides, and 
the polypeptide complex is quantified. Data obtained using different concentrations of 
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polypeptide are used to calculate values for the number, affinity, and association of 
polypeptide with the protein complex. 

Labeled polypeptides are also useful as reagents for the purification of 
molecules with which the polypeptide interacts including, but not limited to, inhibitors. 

5 In one embodiment of affinity purification, a polypeptide is covalently coupled to a 

chromatography column. Cells and their membranes are extracted, and various cellular 
subcomponents are passed over the column. Molecules bind to the column by virtue 
of their affinity to the polypeptide. The polypeptide-complex is recovered from the 
column, dissociated and the recovered molecule is subjected to protein sequencing. 

10 This amino acid sequence is then used to identify the captured molecule or to design 
degenerate oligonucleotides for cloning the corresponding gene from an appropriate 
cDNA library. 

Alternatively, compounds may be identified which exhibit similar properties to 
the ligand for the NgR of the invention, but which are smaller and exhibit a longer half 
15 time than the endogenous ligand in a human or animal body. When an organic 
compound is designed, a molecule according to the invention is used as a "lead" 
compound. The design of mimetics to known pharmaceutically active compounds is a 
well-known approach in the development of pharmaceuticals based on such "lead" 
compounds. Mimetic design, synthesis and testing are generally used to avoid 
20 randomly screening a large number of molecules for a target property. Furthermore, 
structural data deriving from the analysis of the deduced amino acid sequences 
encoded by the DNAs of the present invention are useful to design new drugs, more 
specific and therefore with a higher pharmacological potency. 

Comparison of the protein sequence of the present invention with the 
25 sequences present in all the available databases showed a significant homology with the 
transmembrane portion of G protein coupled receptors. Accordingly, computer 
modeling can be used to develop a putative tertiary structure of the proteins of the 
invention based on the available information of the transmembrane domain of other 
proteins. Thus, novel ligands based on the predicted structure of NgR can be 
30 designed. 

This invention further pertains to novel agents identified by the above-described 
screening assays and uses thereof for treatments as described herein. 



WO 02/29059 


PCT/US01/31488 


- 88 - 

Compositions and Pharmaceutical Compositions 

In a particular embodiment, the novel molecules identified by the screening 
methods according to the invention are low molecular weight organic molecules, in 
which case a composition or pharmaceutical composition can be prepared thereof for 
5 oral or parenteral administration. The compositions, or pharmaceutical compositions, 
comprising the nucleic acid molecules, vectors, polypeptides, antibodies and 
compounds identified by the screening methods described herein, typically comprise 
the nucleic acid molecule, protein, or antibody and a pharmaceutically acceptable 
carrier. As used herein, "pharmaceutically acceptable carrier" is intended to include 
10 any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, 

isotonic and absorption delaying agents, and the like, compatible with pharmaceutical 
administration. The nature of the carrier or other ingredients will depend on the 
specific route of administration and particular embodiment of the invention to be 
administered. Examples of techniques and protocols that are useful in this context are, 
15 inter alia , found in Remington's PHARMACEUTICAL SCIENCES, 16th ed., (1980) Osol, 
A (Ed.), which is incorporated herein by reference in its entirety. Preferred examples 
of such carriers or diluents include, but are not limited to, water, saline. Ringer's 
solution, dextrose solution and 5% human serum albumin. Liposomes and 
non-aqueous vehicles such as fixed oils may also be used. The use of such media and 
20 agents for pharmaceutically active substances is well known in the art. Except insofar 
as any conventional media or agent is incompatible with the active compound, use 
thereof in the compositions is contemplated. Supplementary active compounds can 
also be incorporated into the compositions. 

A pharmaceutical composition of the invention is formulated to be compatible 
25 with its intended route of administration. Examples of routes of administration include 
oral and parenteral (e.g., intravenous, intradermal, subcutaneous, inhalation, 
transdermal (topical), transmucosal and rectal administration). Solutions or 
suspensions used for parenteral, intradermal or subcutaneous application can include 
the following components: a sterile diluent such as water for injection, saline solution, 
30 fixed oils, polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 
antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as 
ascorbic acid or sodium bisulfite; chelating agents such as ethylenediaminetetraacetic 
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acid; buffers such as acetates, citrates or phosphates, and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. The pH can be adjusted with acids or 
bases, such as hydrochloric acid or sodium hydroxide. The parenteral preparation can 
be enclosed in ampoules, disposable syringes or multiple dose vials made of glass or 
5 plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersion. For 
intravenous administration, suitable carriers include physiological saline, bacteriostatic 
10 water, Cremophor EL™ (BASF, Parsippany, NJ) or phosphate buffered saline (PBS). 

In all cases, the composition must be sterile and should be fluid to the extent that easy 
syringeability exists. It must be stable under the conditions of manufacture and storage 
and must be preserved against the contaminating action of microorganisms such as 
bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for 
15 example, water, ethanol, polyol (for example, glycerol, propylene glycol and liquid 
polyethylene glycol, and the like), and suitable mixtures thereof. The proper fluidity 
can be maintained, for example, by the use of a coating such as lecithin, by the 
maintenance of the required particle size in the case of dispersion and by the use of 
surfactants. Prevention of the action of microorganisms can be achieved by various 
20 antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, 

ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars, polyalcohols such as manitol, sorbitol, sodium 
chloride in the composition. Prolonged absorption of the injectable compositions can 
be brought about by including in the composition an agent which delays absorption, for 
25 example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active 
compound (e.g., a NgR protein or anti-NgR antibody) in the required amount in ah 
appropriate solvent with one or a combination of ingredients enumerated above, as 
required, followed by filtered sterilization. Generally, dispersions are prepared by 
30 incorporating the active compound into a sterile vehicle that contains a basic dispersion 
medium and the required other ingredients from those enumerated above. In the case 
of sterile powders for the preparation of sterile injectable solutions, methods of 
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preparation are vacuum drying and freeze-drying that yields a powder of the active 
ingredient plus any additional desired ingredient from a previously sterile-filtered 
solution thereof 

Oral compositions generally include an inert diluent or an edible carrier. They 
5 can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients 
and used in the form of tablets, troches or capsules. Oral compositions can also be 
prepared using a fluid carrier for use as a mouthwash, wherein the compound in the 
fluid carrier is applied orally and swished and expectorated or swallowed. 

10 Pharmaceutically compatible binding agents, and/or adjuvant materials can be included 
as part of the composition. The tablets, pills, capsules, troches and the like can contain 
any of the following ingredients, or compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
lactose, a disintegrating agent such as alginic acid, Primogel or corn starch; a lubricant 
1 5 such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a 

sweetening agent such as sucrose or saccharin; or a flavoring agent such as 
peppermint, methyl salicylate or orange flavoring. 

For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from pressured container or dispenser which contains a suitable 
20 propellant, e.g., a gas such as carbon dioxide or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. 

For transmucosal or transdermal administration, penetrants appropriate to the barrier 
to be permeated are used in the formulation. Such penetrants are generally known in 
the art, and include, for example, for transmucosal administration, detergents, bile 
25 salts, and fusidic acid derivatives. Transmucosal administration can be accomplished 
through the use of nasal sprays or suppositories. For transdermal administration, the 
active compounds are formulated into ointments, salves, gels or creams as generally 
known in the art. 

The compounds can also be prepared in the form of suppositories (e.g., with 
30 conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 
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In one embodiment, the active compounds are prepared with carriers that will 
protect the compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 

5 polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. 

Methods for preparation of such formulations will be apparent to those skilled in the 
art. The materials can also be obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected 
cells with monoclonal antibodies to viral antigens) can also be used as pharmaceutically 
10 acceptable carriers. These can be prepared according to methods known to those 
skilled in the art, for example, as described in U.S. Patent No. 4,522,81 1. It is 
especially advantageous to formulate oral or parenteral compositions in dosage unit 
form for ease of administration and uniformity of dosage. Dosage unit form as used 
herein refers to physically discrete units suited as unitary dosages for the subject to be 
15 treated; each unit containing a predetermined quantity of active compound calculated 
to produce the desired therapeutic effect in association with the required 
pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique characteristics of the active 
compound and the particular therapeutic effect to be achieved. 

20 The nucleic acid molecules of the invention can be inserted into vectors and 

used as gene therapy vectors. Gene therapy vectors can be delivered to a subject by 
any of a number of routes, e.g., as described in U.S. Patent No. 5,703,055. Delivery 
can thus also include, e.g., intravenous injection, local administration (see U.S. Patent 
No. 5,328,470) or stereotactic injection (see e.g., Chen et al. (1994) Proc. Natl. Acad. 
25 Sci. USA 91, 3054-3057). The pharmaceutical preparation of the gene therapy vector 
can include the gene therapy vector in an acceptable diluent, or can comprise a slow 
release matrix in which the gene delivery vehicle is imbedded. Alternatively, where the 
complete gene delivery vector can be produced intact from recombinant cells, e.g., 
retroviral vectors, the pharmaceutical preparation can include one or more cells that 
30 produce the gene delivery system. 

The pharmaceutical compositions can be included in a container, pack or 
dispenser together with instructions for administration. 
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The dosage of these low molecular weight compounds will depend on the 
disease state or condition to be treated and other clinical factors such as weight and 
condition of the human or animal and the route of administration of the compound. 

For treating human or animals, between approximately 0.5 mg/kg of body weight to 
5 500 mg/kg of body weight of the compound can be administered. Therapy is typically 

administered at lower dosages and is continued until the desired therapeutic outcome is 
observed. 

Another aspect of the present invention is the use of the NgR nucleotide 
sequences disclosed herein for identifying homologs of the Nogo-R, in other animals, 

10 including but not limited to humans and other mammals and invertebrates. Any of the 
nucleotide sequences disclosed herein, or any portion thereof, can be used, for 
example, as probes to screen databases or nucleic acid libraries, such as, for example, 
genomic or cDNA libraries, to identify homologs using screening procedures well 
known to those skilled in the art. Accordingly, homologs having at least 50%, more 
15 preferably at least 60%, more preferably at least 70%, more preferably at least 80%, 
more preferably at least 90%, more preferably at least 95%, and most preferably at 
least 100% homology with NgR sequences can be identified. 

The present compounds and methods, including nucleic acid molecules, 
polypeptides, antibodies, compounds identified by the screening methods described 
20 herein, have a variety of pharmaceutical applications and may be used, for example, to 
treat or prevent unregulated cellular growth, such as cancer cell and tumor growth. In 
a particular embodiment, the present molecules are used in gene therapy. For a review 
of gene therapy procedures, see e.g. Anderson Science (1992) 256, 808-813, which is 
incorporated herein by reference in its entirety. 

25 The present invention also encompasses a method of agonizing (stimulating) or 

antagonizing a NgR natural binding partner associated activity in a mammal comprising 
administering to said mammal an agonist or antagonist to one of the above disclosed 
polypeptides in an amount sufficient to effect said agonism or antagonism. One 
embodiment of the present invention, then, is a method of treating diseases in a 
30 mammal with an agonist or antagonist of the protein of the present invention 
comprising administering the agonist or antagonist to a mammal in an amount 
sufficient to agonize or antagonize NgR-associated functions. 
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Methods of determining the dosages of compounds to be administered to a 
patient and modes of administering compounds to an organism are disclosed in U.S. 
Application Serial No. 08/702,282, filed August 23, 1996, and International patent 
publication number WO 96/22976, published August 1, 1996, both of which are 
5 incorporated herein by reference in their entirety, including any drawings, figures or 
tables. Those skilled in the art will appreciate that such descriptions are applicable to 
the present invention and can be easily adapted to it. 

The proper dosage depends on various factors such as the type of disease being 
treated, the particular composition being used and the size and physiological condition 
10 of the patient. Therapeutically effective doses for the compounds described herein can 
be estimated initially from cell culture and animal models. For example, a dose can be 
formulated in animal models to achieve a circulating concentration range that initially 
takes into account the IC 50 as determined in cell culture assays. The animal model data 
can be used to more accurately determine useful doses in humans. 

15 Plasma half-life and biodistribution of the drug and metabolites in the plasma, 

tumors and major organs can also be determined to facilitate the selection of drugs 
most appropriate to inhibit a disorder. Such measurements can be carried out. For 
example, HPLC analysis can be performed on the plasma of animals treated with the 
drug and the location of radiolabeled compounds can be determined using detection 
20 methods such as X-ray, CAT scan and MRI. Compounds that show potent inhibitory 
activity in the screening assays, but have poor pharmacokinetic characteristics, can be 
optimized by altering the chemical structure and retesting. In this regard, compounds 
displaying good pharmacokinetic characteristics can be used as a model. 

Toxicity studies can also be carried out by measuring the blood cell 
25 composition. For example, toxicity studies can be carried out in a suitable animal 
model as follows: (1) the compound is administered to mice (an untreated control 
mouse should also be used); (2) blood samples are periodically obtained via the tail 
vein from one mouse in each treatment group; and (3) the samples are analyzed for red 
and white blood cell counts, blood cell composition and the percent of lymphocytes 
30 versus polymorphonuclear cells. A comparison of results for each dosing regime with 
the controls indicates if toxicity is present. 
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At the termination of each toxicity study, further studies can be carried out by 
sacrificing the animals (preferably, in accordance with the American Veterinary 
Medical Association guidelines Report of the American Veterinary Medical Assoc. 
Panel on Euthanasia, (1993) J Am. Vet. Med. Assoc. 202:229-249). Representative 
5 animals from each treatment group can then be examined by gross necropsy for 

immediate evidence of metastasis, unusual illness or toxicity. Gross abnormalities in 
tissue are noted and tissues are examined histologically. Compounds causing a 
reduction in body weight or blood components are less preferred, as are compounds 
having an adverse effect on major organs. In general, the greater the adverse effect the 
10 less preferred the compound. 

For the treatment of cancers the expected daily dose of a hydrophobic 
pharmaceutical agent is between 1 to 500 mg/day, preferably 1 to 250 mg/day, and 
most preferably 1 to 50 mg/day. Drugs can be delivered less frequently provided 
plasma levels of the active moiety are sufficient to maintain therapeutic effectiveness. 

15 Plasma levels should reflect the potency of the drug. Generally, the more potent the 
compound the lower the plasma levels necessary to achieve efficacy. 

NgR mRNA transcripts have been found in the brain and heart. SEQ ID NOs: 

1 and/or, 3 will, as detailed above, enable screening the endogenous 
neurotransmitters/hormones/ligands which activate, agonize, or antagonize NgR and 
20 for compounds with potential utility in treating disorders including CNS disorders 

(e.g., stroke) and degenerative disorders such as those associated with demyelination. 

For example, NgR receptor activation may mediate the prevention of neurite 
outgrowth. Inhibition would be beneficial in both chronic and acute brain injury. See, 
e.g., Donovan et al., (1997) J. Neurosci. 17, 5316-5326; Turgeon et al., (1998) J. 

25 Neurosci. 18, 6882-6891; Smith-Swintosky et al., (1997) J! Neurochem. 69, 

1890-1896; Gill et al., (1998) Brain Res. 797, 321-327; Suidan et al., (1996) Semin. 
Thromb. Hemost. 22, 125-133. 
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Pharmacogenomics 

Agents, or modulators that have a stimulatory or inhibitory effect on NgR 
activity (e.g., NgR gene expression), as identified by a screening assay described herein 
can be administered to individuals to treat (prophylactically or therapeutically) 

5 disorders (e.g., a disease condition such as a demyelination disorder) associated with 
aberrant NgR activity. In conjunction with such treatment, the pharmacogenomics 
(i.e., the study of the relationship between an individual's genotype and that individual's 
response to a foreign compound or drug) of the individual may be considered. 
Differences in metabolism of therapeutics can lead to severe toxicity or therapeutic 
10 failure by altering the relation between dose and blood concentration of the 

pharmacologically active drug. Thus, the pharmacogenomics of the individual permits 
the selection of effective agents (e.g., drugs) for prophylactic or therapeutic treatments 
based on a consideration of the individual's genotype. Such pharmacogenomics can 
further be used to determine appropriate dosages and therapeutic regimens. 

15 Accordingly, the activity of NgR protein, expression of NgR nucleic acid or mutation 

content of NgR genes in an individual can be determined to thereby select appropriate 
agent(s) for therapeutic or prophylactic treatment of the individual. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected 
20 persons. See e.g., Eichelbaum (1996) Clin. Exp. Pharmacol. Physiol. 23, 983-985 and 
Linder (1997) Clin. Chem. 43, 254-266. In general, two types of pharmacogenetic 
conditions can be differentiated. Genetic conditions transmitted as a single factor 
altering the way drugs act on the body (altered drug action) or genetic conditions 
transmitted as single factors altering the way the body acts on drugs (altered drug 
25 metabolism). These pharmacogenetic conditions can occur either as rare defects or as 
polymorphisms. For example, glucose-6-phosphate dehydrogenase (G6PD) deficiency 
is a common inherited enzymopathy in which the main clinical complication is 
haemolysis after ingestion of oxidant drugs (anti-malarials, sulfonamides, analgesics, 
nitrofurans) and consumption of fava beans. 

30 As an illustrative embodiment, the activity of drug metabolizing enzymes is a 

major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 
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(NAT 2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an 
explanation as to why some patients do not obtain the expected drug effects or show 
exaggerated drug response and serious toxicity after taking the standard and safe dose 
of a drug. These polymorphisms are expressed in two phenotypes in the population, 

5 the extensive metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is 
different among different populations. For example, the gene coding for CYP2D6 is 
highly polymorphic and several mutations have been identified in PM, which all lead to 
the absence of functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 
quite frequently experience exaggerated drug response and side effects when they 
10 receive standard doses. If a metabolite is the active therapeutic moiety, PM show no 
therapeutic response, as demonstrated for the analgesic effect of codeine mediated by 
its CYP2D6~formed metabolite morphine. At the other extreme are the so called ultra- 
rapid metabolizers who do not respond to standard doses. Recently, the molecular 
basis of ultra-rapid metabolism has been identified to be due to CYP2D6 gene 
15 amplification. 

Thus, the activity of NgR protein, expression of NgR nucleic acid, or mutation 
content of NgR genes in an individual can be determined to thereby select appropriate 
agent(s) for therapeutic or prophylactic treatment of the individual. In addition, 
pharmacogenetic studies can be used to apply genotyping of polymorphic alleles 
20 encoding drug-metabolizing enzymes to the identification of an individual's drug 

responsiveness phenotype. This knowledge, when applied to dosing or drug selection, 
can avoid adverse reactions or therapeutic failure and thus enhance therapeutic or 
prophylactic efficiency when treating a subject with a NgR modulator, such as a 
modulator identified by one of the exemplary screening assays described herein. 

25 

Monitoring Clinical Efficacy 

Monitoring the i nfl uence of agents (e.g., drugs, compounds) on the expression 
or activity of NgR (e.g., the ability to modulate aberrant cell proliferation and/or 
differentiation) can be applied not only in basic drug screening, but also in clinical 
30 trials. For example, the effectiveness of an agent determined by a screening assay as 
described herein to increase NgR gene expression, protein levels or upregulate NgR 
activity, can be monitored in clinical trials of subjects exhibiting decreased NgR gene 
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expression, protein levels, or downregulated NgR activity. Alternatively, the 
effectiveness of an agent determined by a screening assay to decrease NgR gene 
expression, protein levels, or downregulate NgR activity, can be monitored in clinical 
trials of subjects exhibiting increased NgR gene expression, protein levels, or 
5 upregulated NgR activity. In such clinical trials, the expression or activity of NgR and, 
preferably, other genes that have been implicated in, for example, a disease or disorder, 
can be used as a "read out" or markers of the immune responsiveness of a particular 
cell. 

For example, genes, including NgR, that are modulated in cells by treatment 
10 with an agent (e.g., compound, drug or small molecule) that modulates NgR activity 
(e.g., identified in a screening assay as described herein) can be identified. Thus, to 
study the effect of agents on demyelination disorders, for example, in a clinical trial, 
cells can be isolated and RNA prepared and analyzed for the levels of expression of 
NgR and other genes implicated in the disorder. The levels of gene expression (/'. e . , a 
15 gene expression pattern) can be quantified by Northern blot analysis or RT-PCR, as 

described herein, or alternatively by measuring the amount of protein produced by one 
of the methods as described herein or by measuring the levels of activity of NgR or 
other genes. In this way, the gene expression pattern can serve as a marker, indicative 
of the physiological response of the cells to the agent. Accordingly, this response state 
20 may be determined before, and at various points during, treatment of the individual 
with the agent. 

In one embodiment, the invention provides a method for monitoring the 
effectiveness of treatment of a subject with an agent (e.g., an agonist, antagonist, 
protein, peptide, peptidomimetic, nucleic acid, small molecule, or other drug candidate 
25 identified by the screening assays described herein) comprising the steps of (/') 

obtaining a pre-administration sample from a subject prior to administration of the 
agent; (//) detecting the level of expression of a NgR protein, mRNA, or genomic 
DNA in the preadministration sample; (iii) obtaining one or more post-administration 
samples from the subject; (iv) detecting the level of expression or activity of the NgR 
30 protein, mRNA, or genomic DNA in the post-administration samples; (v) comparing 
the level of expression or activity of the NgR protein, mRNA or genomic DNA in the 
pre-administration sample with the NgR protein, mRNA or genomic DNA in the post 
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administration sample or samples; and (vz) altering the administration of the agent to 
the subject accordingly. For example, increased administration of the agent may be 
desirable to increase the expression or activity of NgR to higher levels than detected, 
i. e . , to increase the effectiveness of the agent. Alternatively, decreased administration 
5 of the agent may be desirable to decrease expression or activity of NgR to lower levels 

than detected, i.e., to decrease the effectiveness of the agent. 

Methods of Treatment 

The present invention provides for both prophylactic and therapeutic methods 
10 of treating a subject at risk of (or susceptible to) a disorder or having a disorder 

associated with aberrant NgR expression or activity. 

Diseases and disorders that are characterized by increased (relative to a subject 
not suffering from the disease or disorder) levels or biological activity may be treated 
with Therapeutics that antagonize (i.e., reduce or inhibit) activity. Therapeutics that 
15 antagonize activity may be administered in a therapeutic or prophylactic manner. 

Therapeutics that may be utilized include, but are not limited to, (z) a NgR polypeptide, 
or analogs, derivatives, fragments or homologs thereof; (zz) antibodies to a NgR 
peptide; (Hi) nucleic acids encoding a NgR peptide; (z'v) administration of antisense 
nucleic acid and nucleic acids that are "dysfunctional" (i.e., due to a heterologous 
20 insertion within the coding sequences of coding sequences to a NgR peptide) are 
utilized to "knockout" endogenous function of a NgR peptide by homologous 
recombination (see, e.g., Capecchi (1989) Science 244, 1288-1292); or (v) modulators 
(i.e., inhibitors, agonists and antagonists, including additional peptide mimetic of the 
invention or antibodies specific to a peptide of the invention) that alter the interaction 
25 between a NgR peptide and its binding partner. 

Diseases and disorders that are characterized by decreased (relative to a subject 
not suffering from the disease or disorder) levels or biological activity may be treated 
with Therapeutics that increase (i.e., are agonists to) activity. Therapeutics that 
upregulate activity may be administered in a therapeutic or prophylactic manner. 

30 Therapeutics that may be utilized include, but are not limited to, a NgR peptide, or 
analogs, derivatives, fragments or homologs thereof; or an agonist that increases 
bioavailability. 
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Increased or decreased levels can be readily detected by quantifying peptide 
and/or RNA, by obtaining a patient tissue sample (e.g., from biopsy tissue) and 
assaying it in vitro for RNA or peptide levels, structure and/or activity of the expressed 
peptides (or mRNAs of a NgR peptide). Methods that are well-known within the art 
5 include, but are not limited to, immunoassays {e.g., by Western blot analysis, 

immunoprecipitation followed by sodium dodecyl sulfate (SDS) polyacrylamide gel 
electrophoresis, immunocytochemistry, etc.) and/or hybridization assays to detect 
expression of mRNAs {e.g.. Northern assays, dot blots, in situ hybridization, etc.). 

In one aspect, the invention provides a method for preventing, in a subject, a 
10 disease or condition associated with an aberrant NgR expression or activity, by 

administering to the subject an agent that modulates NgR expression or at least one 
NgR activity. Subjects at risk for a disease that is caused or contributed to by aberrant 
NgR expression or activity can be identified by, for example, any or a combination of 
diagnostic or prognostic assays as described herein. Administration of a prophylactic 
15 agent can occur prior to the manifestation of symptoms characteristic of the NgR 

aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in its 
progression. Depending on the type of NgR aberrancy, for example, a NgR agonist or 
NgR antagonist agent can be used for treating the subject. The appropriate agent can 
be determined based on screening assays described herein. 

20 Another aspect of the invention pertains to methods of modulating NgR 

expression or activity for therapeutic purposes. The modulatory method of the 
invention involves contacting a cell with an agent that modulates one or more of the 
activities of NgR protein activity associated with the cell. An agent that modulates 
NgR protein activity can be an agent as described herein, such as a nucleic acid or a 
25 protein, a naturally-occurring cognate ligand of a NgR protein, a peptide, a NgR 

peptidomimetic, or other small molecule. In one embodiment, the agent stimulates one 
or more NgR protein activity. Examples of such stimulatory agents include active 
NgR protein and a nucleic acid molecule encoding NgR that has been introduced into 
the cell. In another embodiment, the agent inhibits one or more NgR protein activity. 
30 Examples of such inhibitory agents include antisense NgR nucleic acid molecules and 
anti-NgR antibodies. These modulatory methods can be performed in vitro {e.g., by 
culturing the cell with the agent) or, alternatively, in vivo {e.g., by administering the 
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agent to a subject). As such, the present invention provides methods of treating an 
individual afflicted with a disease or disorder characterized by aberrant expression or 
activity of a NgR protein or nucleic acid molecule. In one embodiment, the method 
involves administering an agent (e.g., an agent identified by a screening assay described 
5 herein), or combination of agents that modulates (e.g., upregulates or downregulates) 
NgR expression or activity. In another embodiment, the method involves 
administering a NgR protein or nucleic acid molecule as therapy to compensate for 
reduced or aberrant NgR expression or activity. 

10 Gene Therapy 

Mutations in the NgR gene that result in loss of normal function of the NgR 
gene product underlie NgR human disease states. The invention comprehends gene 
therapy to restore NgR activity to treat those disease states. Delivery of a functional 
NgR gene to appropriate cells is effected ex vivo, in situ, or in vivo by use of vectors, 
15 and more particularly viral vectors (e.g., adenovirus, adeno-associated virus, or a 
retrovirus), or ex vivo by use of physical DNA transfer methods (e.g., liposomes or 
chemical treatments). See, for example, Anderson (1998) Nature, supplement to 
392(6679):25-20. For additional reviews of gene therapy technology see Friedmann 
(1989) Science 244, 1275-1281; Verma (1990) Sci. Am. 68-84; and Miller (1992) 

20 Nature 357, 455-460. Alternatively, it is contemplated that in other human disease 

states, preventing the expression of, or inhibiting the activity of, NgR will be useful in 
.'treating disease states. It is contemplated that antisense therapy or gene therapy could 
be applied to negatively regulate the expression of NgR. 

The present invention provides for both prophylactic and therapeutic methods 
25 of treating a subject at risk of (or susceptible to) a disorder or having a disorder 

associated with aberrant NgR expression or activity. 

Diseases and disorders that are characterized by increased (relative to a subject 
not suffering from the disease or disorder) levels or biological activity may be treated 
with Therapeutics that antagonize (i.e., reduce or inhibit) activity. Therapeutics that 
30 antagonize activity may be administered in a therapeutic or prophylactic manner. 

Therapeutics that may be utilized include, but are not limited to, (i) a NgR polypeptide, 
or analogs, derivatives, fragments or homologs thereof; (ii) antibodies to a NgR 
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peptide; (iif) nucleic acids encoding a NgR peptide; (iv) administration of antisense 
nucleic acid and nucleic acids that are "dysfunctional" (i.e., due to a heterologous 
insertion within the coding sequences of coding sequences to a NgR peptide) are 
utilized to "knockout" endogenous function of a NgR peptide by homologous 
5 recombination (see, e.g., Capecchi (1989) , above); or (v) modulators (i.e., inhibitors, 
agonists and antagonists, including additional peptide mimetic of the invention or 
antibodies specific to a peptide of the invention) that alter the interaction between a 
NgR peptide and its binding partner. 

Diseases and disorders that are characterized by decreased (relative to a subject 
10 not suffering from the disease or disorder) levels or biological activity may be treated 
with Therapeutics that increase (i.e., are agonists to) activity. Therapeutics that 
upregulate activity may be administered in a therapeutic or prophylactic manner. 
Therapeutics that may be utilized include, but are not limited to, a NgR peptide, or 
analogs, derivatives, fragments or homologs thereof; or an agonist that increases 
15 bioavailability. 

Increased or decreased levels can be readily detected by quantifying peptide 
and/or RNA, by obtaining a patient tissue sample (e.g., from biopsy tissue) and 
assaying it in vitro for RNA or peptide levels, structure and/or activity of the expressed 
peptides (or mRNAs of a NgR peptide). Methods that are well-known within the art 
20 include, but are not limited to, immunoassays (e.g., by Western blot analysis, 

immunoprecipitation followed by sodium dodecyl sulfate (SDS) polyacrylamide gel 
electrophoresis, immunocytochemistry, etc.) and/or hybridization assays to detect 
expression of mRNAs (e.g.. Northern assays, dot blots, in situ hybridization, etc.). 

In one aspect, the invention provides a method for preventing, in a subject, a 
25 disease or condition associated with an aberrant NgR expression or activity, by 

administering to the subject an agent that modulates NgR expression or at least one 
NgR activity. Subjects at risk for a disease that is caused or contributed to by aberrant 
NgR expression or activity can be identified by, for example, any or a combination of 
diagnostic or prognostic assays as described herein. Administration of a prophylactic 
30 agent can occur prior to the manifestation of symptoms characteristic of the NgR 

aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in its 
progression. Depending on the type of NgR aberrancy, for example, a NgR agonist or 
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NgR antagonist agent can be used for treating the subject. The appropriate agent can 
be determined based on screening assays described herein. 

Another aspect of the invention pertains to methods of modulating NgR 
expression or activity for therapeutic purposes. The modulatory method of the 
5 invention involves contacting a cell with an agent that modulates one or more of the 
activities of NgR protein activity associated with the cell. An agent that modulates 
NgR protein activity can be an agent as described herein, such as a nucleic acid or a 
protein, a naturally-occurring cognate ligand of a NgR protein, a peptide, a NgR 
peptidomimetic, or other small molecule. In one embodiment, the agent stimulates one 
10 or more NgR protein activity. Examples of such stimulatory agents include active 

NgR protein and a nucleic acid molecule encoding NgR that has been introduced into 
the cell. In another embodiment, the agent inhibits one or more NgR protein activity. 
Examples of such inhibitory agents include antisense NgR nucleic acid molecules and 
anti-NgR antibodies. These modulatory methods can be performed in vitro (e.g., by 
15 culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering the 
agent to a subject). As such, the present invention provides methods of treating an 
individual afflicted with a disease or disorder characterized by aberrant expression or 
activity of a NgR protein or nucleic acid molecule. In one embodiment, the method 
involves administering an agent (e.g., an agent identified by a screening assay described 
20 herein), or combination of agents that modulates (e.g. , upregulates or downregulates) 
NgR expression or activity. In another embodiment, the method involves 
administering a NgR protein or nucleic acid molecule as therapy to compensate for 
reduced or aberrant NgR expression or activity. 

The present invention is not to be limited in scope by the specific embodiments 
25 described herein. Indeed, various modifications of the invention in addition to those 
described herein will become apparent to those skilled in the art from the foregoing 
description and accompanying figure. Such modifications are intended to fall within 
the scope of the appended claims. 

The following Table 5 contains the sequences of exemplary polynucleotides 
30 and polypeptides of the invention. 
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TABLE 5 


The following DNA sequence NgR2 <SEQ ID NO. 1> was identified in humans: 


ATGCTGCCCGGGCTCAGGCGCCTGCTGCAAGCTCCCGCCTCGGCCTGCCTCCTGCTGATG 

CTCCTGGCCCTGCCCCTGGCGGCCCCCAGCTGCCCCATGCTCTGCACCTGCTACTCATCC 

CCGCCCACCGTGAGCTGCCAGGCCAACAACTTCTCCTCTGTGCCGCTGTCCCTGCCACCC 

AGCACTCAGCGACTCTTCCTGCAGAACAACCTCATCCGCACGCTGCGGCCAGGCACCTTT 

GGGTCCAACCTGCTCACCCTGTGGCTCTTCTCCAACAACCTCTCCACCATCTACCCGGGC 

ACTTTCCGCCACTTGCAAGCCCTGGAGGAGCTGGACCTCGGTGACAACCGGCACCTGCGC 

TCGCTGGAGCCCGACACCTTCCAGGGCCTGGAGCGGCTGCAGTCGCTGCATTTGTACCGC 

TGCCAGCTCAGCAGCCTGCCCGGCAACATCTTCCGAGGCCTGGTCAGCCTGCAGTACCTC 

TACCTCCAGGAGAACAGCCTGCTCCACCTACAGGATGACTTGTTCGCGGACCTGGCCAAC 

CTGAGCCACCTCTTCCTCCACGGGAACCGCCTGCGGCTGCTCACAGAGCACGTGTTTCGC 

GGCCTGGGCAGCCTGGACCGGCTGCTGCTGCACGGGAACCGGCTGCAGGGCGTGCACCGC 

GCGGCCTTCCGCGGCCTCAGCCGCCTCACCATCCTCTACCTGTTCAACAACAGCCTGGCC 

TCGCTGCCCGGCGAGGCGCTCGCCGACCTGCCCTCGCTCGAGTTCCTGCGGCTCAACGCT 

AACCCCTGGGCGTGCGACTGCCGCGCGCGGCCGCTCTGGGCCTGGTTCCAGCGCGCGCGC 

GTGTCCAGCTCCGACGTGACCTGCGCCACCCCCCCGGAGCGCCAGGGCCGAGACCTGCGC 

GCGCTCCGCGAGGCCGACTTCCAGGCGTGTCCGCCCGCGGCACCCACGCGGCCGGGCAGC 

CGCGCCCGCGGCAACAGCTCCTCCAACCACCTGTACGGGGTGGCCGAGGCCGGGGCGCCC 

CCAGCCGATCCCTCCACCCTCTACCGAGATCTGCCTGCCGAAGACTCGCGGGGGCGCCAG 

GGCGGGGACGCGCCTACTGAGGACGACTACTGGGGGGGCTACGGGGGTGAGGACCAGCGA 

GGGGAGCAGATGTGCCCCGGCGCTGCCTGCCAGGCGCCCCCGGACTCCCGAGGCCCTGCG 

CTCTCGGCCGGGCTCCCCAGCCCTCTGCTTTGCCTCCTGCTCCTGGTGCCCCACCACCTC 


The following amino acid sequence <SEQ ID NO. 2> is the predicted amino acid 

sequence derived from the DNA sequence of SEQ ID NO. 1 : 

MLPGLRRLLQAPASACLLLMLLALPLAAPSC 
PMLCTCYSSPPTVSCQANNFSSVPLSLPPST 
QRLFLQNNLIRTLRPGTFGSNLLTLWLFSNN 
LSTIYPGTFRHLQALEELDLGDNRHLRSLEP 
DTFQGLERLQSLHLYRCQL S SLP GNIFRGLV 
SLQYLYLQENSLLHLQDDLFADLANLSHLFL 
HGNRLRLLTEHVFRGLGSLDRLLLHGNRLQG 
VHRAAFRGLSRLTILYLFNNSLASLPGEALA 
DLPSLEFLRLNANPWACDCRARPLWAWFQRA 
RVSSSDVTCATPPERQGRDLRALREADFQAC 
PPAAPTRPGSRARGNSSSNHLY GVAEAGAPP 
ADPSTLYRDLPAED SRGRQGGDAPTEDDYWG 
GYGGEDQRGEQMCPGAACQAPPDSRGPALSA 
GLP SPLLCLLLLVPHHL 
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The following DNA sequence NgR3 <SEQ ID NO. 3> was identified in mouse: 

ATGTCTTGGCAGTCTGGAACCACAGTGACACAATCTCCCGTGCAGGCTGCTCAGGTCTCA 

GGGTGCTGTGTGGAATTGCTGCTGTTGCTGCTCGCTGGAGAGCTACCTCTGGGTGGTGGT 

TGTCCTCGAGACTGTGTGTGCTACCCTGCGCCCATGACTGTCAGCTGCCAGGCACACAAC 

TTTGCTGCCATCCCGGAGGGCATCCCAGAGGACAGTGAGCGCATCTTCCTGCAGAACAAT 

CGCATCACCTTCCTCCAGCAGGGCCACTTCAGCCCCGCCATGGTCACCCTCTGGATCTAC 

TCCAACAACATCACTTTCATTGCTCCCAACACCTTCGAGGGCTTTGTGCATCTGGAGGAG 

CTAGACCTTGGAGACAACCGACAGCTGCGAACGCTGGCACCCGAGACCTTCCAAGGCCTG 

GTGAAGCTTCACGCCCTCTACCTCTATAAGTGTGGACTGAGCGCCCTGCCCGCAGGCATC 

TTTGGTGGCCTGCACAGCCTGCAGTATCTCTACTTGCAGGACAACCATATCGAGTACCTC 

CAAGATGACATCTTTGTGGACCTGGTCAATCTCAGTCACTTGTTTCTCCATGGTAACAAG 

CTATGGAGCCTGGGCCAAGGCATCTTCCGGGGCCTGGTGAACCTGGACCGGTTGCTGCTG 

CATGAGAACCAGCTACAGTGGGTTCACCACAAGGCTTTCCATGACCTCCACAGGCTAACC 

ACCCTCTTTCTCTTCAACAACAGCCTCACTGAGCTGCAGGGTGACTGTCTGGCCCCCCTG 

GTGGCCTTGGAGTTCCTTCGCCTCAATGGGAATGCTTGGGACTGTGGCTGCCGGGCACGT 

TCCCTGTGGGAATGGCTGCGAAGGTTCCGTGGCTCTAGCTCTGCTGTCCCCTGCGCGACC 

CCCGAGCTGCGGCAAGGCCAGGATCTGAAGCTGCTGAGGGTGGAGGACTTCCGGAACTGC 

ACAGGACCAGTGTCTCCTCACCAGATCAAGTCTCACACGCTTACCACCTCTGACAGGGCT 

GCCCGCAAGGAGCACCATCCGTCCCATGGGGCCTCCAGGGACAAAGGCCACCCACATGGC 

CATCCGCCTGGCTCCAGGTCAGGTTACAAGAAGGCAGGCAAGAACTGCACCAGCCACAGG 

AACCGGAACCAGATCTCTAAGGTGAGCTCTGGGAAAGAGCTTACCGAACTGCAGGACTAT 

GCCCCCGACTATCAGCACAAGTTCAGCTTTGACATCATGCCCACCGCACGACCCAAGAGG 

AAGGGCAAGTGTGCTCGCAGGACCCCCATCCGTGCCCCCAGTGGGGTGCAGCAGGCATCC 

TCAGGCACGGCCCTTGGGGCCCCACTCCTGGCCTGGATACTGGGGCTGGCAGTCACTCTC 

CGC 


The following protein sequence <SEQ ID NO. 4> is deduced protein of SEQ ID 
NO:3: 

MSWQSGTTVTQSPVQAAQVSGCCVELLLLLL 
AGELPLGGGCPRDCVCYPAPMTVSCQAHNFA 
AIPEGIPEDSERIFLQNNRITFLQQGHFSPA 
MVTLWIY SNNITFIAPNTFEGFVHLEELDLG 
DNRQLRTLAPETFQGLVKLHALYLYKCGLSA 
LPAGIFGGLHSLQYLYLQDNHIEYLQDDIFV 
DLVNLSHLFLHGNKLWSLGQGIFRGLVNLDR 
LLLHENQLQWVHHKAFHDLHRLTTLFLFNNS 
LTELQGDCLAPLVALEFLRLNGNAWDCGCRA 
RSLWEWLRRFRGSSSAVPCATPELRQGQDLK 
LLRVEDFRNCTGPVSPHQIKSHTLTTSDRAA 
RKEHHPSHGASRDKGHPHGHPPGSRSGYKKA 
GKNCTSHRNRNQISKVSSGKELTELQDYAPD 
YQHKFSFDIMPT ARPKRKGKC ARRTPIRAPS 
GVQQASSGTALGAPLLAWILGLAVTLR 
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The following protein sequence <SEQ ID NO. 5> is NgRl from humans: 

MKRASAGGSRLLAWVLWLQAWQVAAPCPGA 

C 

CYNEPKVTTSCPQQGLQAVPVGIPAASQRI 
FLHGNRISHVPAASFRACRNLTILWLHSNVL 
ARID AAAFTGLALLEQLDLSDNAQLRSVDPA 
TFHGLGRLHTLHLDRCGLQELGPGLFRGLAA 
LQYLYLQDNALQALPDDTFRDLGNLTHLFLH 
GNRISSVPERAFRGLHSLDRLLLHQNRVAHV 
HPHAFRDLGRLMTLYLFANNLS ALPTEALAP 

lralqylrlndnpwvcdcrarplwawlqk.fr 

GSSSEVPCSLPQRLAGRDLKRLAANDLQGCA 

VATGPYHPIWTGRATDEEPLGLPKCCQPDAA 

DKASVLEPGRPASAGNALKGRVPPGDSPPGN 

GSGPRHINDSPFGTLPGSAEPPLTAVRPEGS 

EPPGFPTSGPRRRPGCSRKNRTRSHCRLGQA 

GSGGGGTGDSEGSGALPSLTCSLTPLGLALV 

LWTVLGPC 

The following amino acid sequence <SEQ ID NO:6> is a Consensus Sequence of 
NgR based on homology with NgRl 

CPXXCXCYXXPXXTXSCXXXXXXXXPX 
XXPXXXXRXFLXXNXIXXXXXXXFXXXXXXXXLWX 
XSNXXXXIXXXXFXXXXXLEXLDLXDNXXLR 
XXXPXTFXGLXXLXLXLXXCXLXXLXXXXFX 
GLXXLQYLYLQXNXXXXLXDDXFXDLXNLXH 
LFLHGNXXXXXXXXXFRGLXXLDRLLLHXNX 
XXXVHXXAFXXLXRLXXLXLFXNXLXXLXXX 
XLAXLXXLXXLRLNXNXWXCXCRARXLWXWX 
XXXRXSSSXVXCXXPXXXXGXDLXXLXXXDX 
XXCXXXXXPXXPXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXGXXXXXXXXXXXX 
PPXXXSXXXXXXXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXXXXXXXXXXRXXX 
XXXXXXXXXXXXXXXXXXXXXXXXLXXXXX 
XX X XX L 


The following protein sequence <SEQ ID NO:7> is the 66 amino acid active domain 
ofNogo: 

RIYKGVIQAIQKSDEGHPFRAYLESEVAISE 
ELVQKYSNSALGHVNCTIKELRRLFLVDDLV 
D S L K 
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The following protein sequence <SEQ ID NO:8> is the amino acid sequence of the 
mature NgR2: 

CPMLCTCYSSPPTVSCQANNFSSVPLSLPPS 
TQRLFLQNNLIRTLRPGTF GSNLLTLWLFSN 
NLSTIYPGTFRHLQALEELDLGDNRHLRSLE 
PDTFQGLERLQSLHLYRCQLSSLPGNIFRGL 
VSLQYLYLQENSLLHLQDDLFADLANLSHLF 
LHGNRLRLLTEHVFRGLGSLDRLLLHGNRLQ 
GVHRAAFRGLSRLTILYLFNN SLASLPGEAL 
ADLPSLEFLRLNANPWACDCRARPLWAWFQR 
ARVSSSDVTCATPPERQGRDLRALREADFQA 
CPPAAPTRPGSRARGNSSSNHLYGVAEAGAP 
PADPSTLYRDLPAEDSRGRQGGDAPTEDDYW 
GGYGGEDQRGEQMCPGAACQAPPD SRGPALS 
AGLPSPLLCLLLL VP H H L 


The following protein sequence <SEQ ID NO:9> is the amino acid sequence of the 
mature NgR3 : 

CPRDCVCYPAPMTVSCQAHNFAAIPEGIPED 
SERIFLQNNRITFLQQGHFSPAMVTLWIYSN 
NITFIAPNTFEGFVHLEELDLGDNRQLRTLA 
PETFQGLVKLHALYLYKCGLSALPAGIFGGL 
HSLQYLYLQDNHIEYLQDDIFVDLVNLSHLF 
LHGNKLWSLGQGIFRGLVNLDRLLLHENQLQ 
WVHHKAFHDLHRLTTLFLFNNSLTELQGDCL 
APLVALEFLRLNGNAWDCGCRARSLWEWLRR 
FRGSSSAVPCATPELRQGQDLKLLRVEDFRN 
CTGPVSPHQIKSHTLTTSDRAARKEHHPSHG 
ASRDKGHPHGHPPGSRSGYKKAGKNCTSHRN 
RNQISKVSSGKELTELQDYAPDYQHKFSFDI 
MPTARPKRKGKCARRTPIRAPSGVQQASSGT 
A L G A PL L AW I L G L A VT LR 


The following amino acid sequence <SEQ ID NO:10> is a conserved cysteine motif 
(Cysteine domain 1) of the NgR and homologs based on the Consensus Sequence: 
CPXXCXCYXXPXXTXSC 
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Unless otherwise indicated, X is any amino acid. For example, X where 
indicated may be no amino acid. Additional features of the invention will be apparent 
from the following Examples. Examples 1-5 are actual, while the remaining Examples 
20 are prophetic. 

As shown by the following Examples, a gene encoding novel NgRs have been 
identified by computational analysis of DNA sequence data. The proteins encoded by 
NgR2 and NgR3 have a putative signal sequence, eight leucine-rich repeat domains in 
a conserved leucine-rich region (SEQ ID NO: 12), a conserved cysteine-rich region 
25 (SEQ ID NO: 10) N-terminal to the leucine-rich region, a second cysteine-rich domain 
(SEQ ID NO: 1 1) C-terminal to the leucine-rich region, and a putative 
glycophosphatidylinositol-linkage (GPI-linkage) site. NgR2 and NgR3 differ from the 
previously identified NgR sequence. The NgR homologs, when compared to known 
NgRs, show a consensus sequence (SEQ ID NOs:6). The putative mature NgR2 and 
30 NgR3 are shown in Table 5 as SEQ ID NOs: 8 and 9, respectively. 

Example 1: Tblastn query of the HTG database 

The protein sequence for the human NgR (NgRl) (SEQ ID NO: 5) was used to 
query the high throughput genomic (HTG) database the use of which is familiar to 
35 those skilled in the art. The HTG database is a part of GenBank, a comprehensive 


The following amino acid sequence <SEQ ID NO: 1 1> is a conserved cysteine motif 

(Cysteine domain 2) of the NgR and homologs based on the Consensus Sequence: 

NXWXCXCRARXLWXWXXXXRXS S SXVXCXXP 
XXX XGXDLXXLXXXDXXXC 


The following amino acid sequence <SEQ ID NO:12> is a conserved Leucine-rich 

domain of the NgR and homologs based on the Consensus Sequence: 

RXFLXXNXIXXXXXXXFXXXXXXXXLWXXSN 
XXXXIXXXXFXXXXXLEXLDLXDNXXLRXXX 
PXTFXGLXXLXLXLXXCXLXXLXXXXFXGLX 
XLQYLYLQXNXXXXLXDDXFXDLXNLXHLFL 
HGNXXXXXXXXXFRGLXXLDRLLLHXNXXXX 
VHXXAFXXLXRLXXLXLFXNXLXXLXXXXL A 
X L XX L XX LR L 
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NIH genetic sequence database, which includes an annotated collection of all publicly 
available DNA sequences {Nucleic Acids Res. (2000) 28, 15-8). The HTG database 
includes sequences obtained from genomic DNA. Within genomic DNA, genes are 
typically encoded by multiple segments of DNA called exons. Thus when one aligns a 
5 cDNA sequence (or a protein sequence encoded by a cDNA sequence) to a genomic 
sequence, the sequence will be broken up into segments depending on the number of 
exons in the gene. 

The BLAST algorithm, which stands for Basic Local Alignment Search Tool is 
suitable for determining sequence similarity (Altschul et al, (1990) J. Mol. Biol. 215, 
10 403-410, which is incorporated herein by reference in its entirety). Software for 

performing BLAST analyses is publicly available through the National Center for 
Biotechnology Information (http://www.ncbi.nlm.nih.gov/). The basic BLAST 
algorithm involves first identifying high scoring sequence pair (HSPs) by identifying 
short words of length W in the query sequence that either match or satisfy some 
15 positive- valued threshold score T when aligned with a word of the same length in a 
database sequence. T is referred to as the neighborhood word score threshold 
(Altschul et al., supra). These initial neighborhood word hits act as seeds for initiating 
searches to find HSPs containing them. The word hits are extended in both directions 
along each sequence for as far as the cumulative alignment score can be increased. 

20 Extension for the word hits in each direction are halted when: 1) the cumulative 

alignment score falls off by the quantity X from its maximum achieved value; 2) the 
cumulative score goes to zero or below, due to the accumulation of one or more 
negative-scoring residue alignments; or 3) the end of either sequence is reached. The 
Blast algorithm parameters W, T and X determine the sensitivity and speed of the 
25 alignment. The Blast program uses as defaults a word length (W) of 1 1, the 

BLOSUM62 scoring matrix (see Henikoff et al., (1992) Proc. Natl. Acad. Sci. USA 
89, 10915-10919, which is incorporated herein by reference in its entirety) alignments 
(B) of 50, expectation (E) of 10, M=5, N=4, and a comparison of both strands. 

The BLAST algorithm (Karlin et al., (1993) Proc. Natl. Acad. Sci. USA 90, 

30 5873-5787, which is incorporated herein by reference) and Gapped BLAST perform a 

statistical analysis of the similarity between two sequences. One measure of similarity 
provided by the BLAST algorithm is the smallest sum probability (P(N)), which 
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provides an indication of the probability by which a match between two nucleotide or 
amino acid sequences would occur by chance. For example, a nucleic acid is 
considered similar to a NgR gene or cDNA if the smallest sum probability in 
comparison of the test nucleic acid to a NgR nucleic acid is less than about 1, 

5 preferably less than about 0. 1, more preferably less than about 0.01, and most 
preferably less than about 0.001. 

To query the HTG database with the NgR protein sequence, we used a 
variation of the BLAST algorithm known as the tblastn program, which compares a 
protein query sequence against a nucleotide sequence database dynamically translated 
10 in all reading frames (J. Mol. Biol. (1990) 215, 403-410: Nucleic Acids Res. (1997) 

25, 3389-3402). The results of the tblastn search indicated the presence of genes in 
the database with a significant identity to the NgR. In addition to finding hits to 
genomic clones which contain the human and mouse NgR genes, we found hits to 
clones where the identity was not as high, but still very significant. Three human 
15 clones were found (Accession numbers: AC068514, AC016869, AC013606) with an 
e- value of 4e-43 and one mouse clone was found (Accession No. AC021768) with an 
e- value of le-78. The three human clones all appeared to encode the same gene, so 
further analysis was confined to AC013606. 

20 Example 2: Prediction of the human NgR2 protein sequence (AC013606) 

The human NgR protein sequence aligned with two regions of translated 
sequence from nucleotide sequence AC013606, indicating that the new gene was 
encoded by at least two exons. In order to define the complete gene, we used the 
computer program GENSCAN™ (J. Mol. Biol. (1997) 268, 78-94) which can identify 
25 complete exon/intron structures of genes in genomic DNA. The gene prediction by 
GENES CAN™ contained seven exons. By comparing these predicted exons to the 
NgR, it was concluded that the new human gene contains two of these exons and a 
part of another (containing the initiating methionine). The predicted cDNA (mRNA) 
encoded by these three exons was assembled from AC0 13 606 (HTG1 1; deposited 
30 March 2000; length = 143899; GenBank release 1 18.0; SEQ ID NO: 15) by combining 
nucleotides from the three exons whose coordinates are: 123292-123322 (exon 1); 
130035-130516 (exon 2); and 138589-139335 (exon 3). The sequence for this cDNA 
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sequence is SEQ ID NO:l (nucleotide sequence of human NgR2; AC013606). The 
translation of this cDNA provides the protein sequence of human NgR2 (SEQ ID 
NO:2). 

We used the protein sequence of human NgR2 as a query sequence against the 
5 human EST database. A number of hits of high significance were found indicating that 
the NgR2 mRNA is expressed in a number of tissues including fetal brain. 

Furthermore, two of these ESTs provided support for the exon structure that we 
deduced. One EST (Accession No: GB_EST19:AI346757) contains 565 nucleotides 
corresponding to amino acids 84-271 of the human NgR2 (SEQ ID No:4). This spans 
10 the second intron located between amino acids 171 and 172, and provides positive 
evidence for the splicing of exons 2 and 3 at the mRNA level. Another EST 
(GB_EST26:AI929019) contains 545 nucleotides, part of which corresponds to amino 
acids 1-75 of the human NgR2 (SEQ ID NO:2). This spans the first intron located 
between amino acids 10 and 11, and provides positive evidence for the splicing of 
15 exons 1 and 2 at the mRNA level. 

Example 3: Prediction of the mouse NgR3 protein sequence (AC021768) 

The human NgR protein sequence aligned with only one region of translated 
sequence from nucleotide sequence AC021768, indicating that most of the new mouse 
20 gene was encoded by one large exon. However, upon inspection, the protein encoded 
by this exon was missing an initiating methionine. In order to define the complete 
gene, we used the computer program GENSCAN as described above. The gene 
prediction by GENSCAN contained two exons; the large one found by visual 
inspection and a short one at the 5' end which provided an initiating methionine. The 
25 predicted cDNA (mRNA) encoded by these two exons was assembled from AC021768 
(HTG14; deposited March 2000; length = 215980; GenBank release 118.0; SEQ ID 
NO: 16) by combining nucleotides from the two exons whose coordinates are: the 
complement of 164265-164325 (exon 1); and the complement of 155671-156992 
(exon 2). The sequence for this cDNA sequence is SEQ ID NO:3 (nucleotide 
30 sequence of mouse NgR3; AC021768). The translation of this cDNA provides the 
protein sequence of mouse NgR3 (SEQ ID NO:4). 
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We used the protein sequence of mouse NgR3 as a query sequence against the 
mouse EST database. One hit of high significance was found indicating that the NgR2 
mRNA is expressed in the heart. This EST (GB_EST20:AI428334) contains 463 
nucleotides, part of which correspond to amino acids 45-193 of mouse NgR3 (SEQ ID 
5 NO:4). 

Example 4: Similarity between the NgRs 

An alignment between NgRl and the two new receptors is shown in Fig. 

1 A-1B. The similarities between these proteins include: 

10 (1) The SignalP program, which locates the signal sequence cleavage 

position, predicts a cleavage before the first conserved cysteine in all the proteins. 

Thus the mature protein in all cases will have a cysteine at the N-terminus. 

(2) All proteins contain eight Leucine Rich Repeats (ERR). LRRs are short 
sequence motifs present in a number of proteins with diverse functions and cellular 

1 5 locations. These repeats are usually involved in protein-protein interactions. Each 
LRR is composed of a beta-alpha unit. 

(3) All three proteins contain a leucine rich repeat N-terminal domain 
(LRRNT), in which four cysteines are conserved. LRRs are often flanked by cysteine 
rich domains at both their N and C termini. 

20 (4) All three proteins contain a LRR C-terminal domain (LRRCT). The 

LRRCTs of the three NgR proteins can be distinguished from those of other LRR 
containing proteins, by the pattern of typtophans and cysteines which are completely 
conserved in this domain. 

(5) All three proteins contain a conserved cysteine in the fourth LRR 

25 domain. 

(6) All three proteins contain a conserved potential glycosylation site in the 
eighth LRR domain. 

(7) NgR2 and NgR3 have a hydrophobic C-terminus, as does NgRl, an 
indication that they probably also undergo a modification similar to NgRl, where a 

30 GPI moiety is covalently linked to a C-terminal amino acid. This allows the protein to 
remain tethered to the cell. 
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Example 5: Preparation of Nogo Proteins 

A Nogo binding assay was developed which utilizes a method widely used in 
examining semaphorin and ephrin axonal guidance function (Flanagan & 
Vanderhaeghen (1998) Annu. Rev. Neurosci. 21,3 09-345; Takahashi et al., (1999) 

5 Cell 99, 59-69). It involves fusing a secreted placental alkaline phosphatase (AP) 

moiety to the ligand in question to provide a biologically active receptor binding agent 
which can be detected with an extremely sensitive colorimetric assay. For Nogo, an 
expression vector is created encoding a signal peptide, a His6 tag for purification, AP, 
and the 66 amino acid active domain of Nogo. The fusion protein can be purified from 
10 the conditioned medium of transfected cells in milligram amounts. This protein is 
biologically active as a growth cone collapsing agent with an EC 50 of 1 nM. 

Alternatively, a glutathione-S-transferase Nogo (GST-Nogo) fusion protein 
may be prepared. For GST-Nogo, an expression vector (e.g., a pGEX vector) is 
created encoding a signal peptide, GST, and the 66 amino acid active domain of Nogo. 
15 GST-Nogo may be purified from the culture medium and used as a GST fusion 

protein, or GST may be cleaved from the Nogo portion of the fusion protein with an 
enzyme that recognizes the specific amino acid cleavage sit engineered between the 
GST portion and the Nogo portion of the fusion protein. Such sites are part of the 
commercially available GST vectors. The specific cleavage sites and enzymes may be 
20 used in accordance with the Manufacturer's specifications. 

It has been found that AP-Nogo is actually slightly more potent than 
GST-Nogo, perhaps because the protein is synthesized in a eukaryotic rather than a 
prokaryotic cell. 

Binding of Nogo to immobilized NgR homologs may be performed in an 
25 ELISA-type assay in which AP-Nogo is allowed to react with an immobilized receptor 
homolog. Specificity of binding may be demonstrated in a competitive binding assay 
using increasing amounts of GST-Nogo in the type of assay to show a decreasing 
amount of binding of AP-Nogo (as judged in the colorimetric assay). 

30 Example 6: Transfected COS Cell binding Assays 

The homologs of the present invention may be used in transfection studies in 
COS cells to demonstrate binding of Nogo. Specifically, nucleotide sequences 
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encoding NgR2 and NgR3 may be transfected into COS cells using a suitable vector. 
Non-transfected COS-7 cells do not bind AP-Nogo. However, transfection of COS 
cells with nucleic acid sequences encoding NgRs will make them capable of binding 
Nogo. AP alone does not bind with any stable affinity to these transfected cells, 

5 indicating that any affinity of Nogo for NgR2 or NgR3 would be due to the 66 amino 
acids derived from Nogo. Furthemore, specific affinity of Nogo for the NgR2 or 
NgR3 proteins may be tested in displacement of AP-Nogo assays using GST-Nogo. 
NgR2 and/or NgR3 may also bind homologs of Nogo, which may also be tested using 
this assay. 

10 

Example 7: Expression of NgR in Human Cell Lines using Northern Blot and a 
Random-Primed Probe 

A Northern blot is purchased from a commercial source, or RNA samples from 
cells of interest are run on an agarose gel and blotted to a membrane using any of the 
1 5 well known techniques for Northern blotting. The blot is probed with a fragment of 

NgR2 (SEQ ID NO: 1) or NgR3 (SEQ ID NO:3). The probe is prepared from 50 ng 
of cDNA labeled by a random-primed method (Feinberg and Vogelstein (1983) Anal. 
Biochem. 132, 6-13). Hybridization is carried out at 68°C for 1 hour in 
ExpressHybTM solution (Clontech, Cat. No. 8015-1) followed by washing with 2X 
20 SSC/0.05% SDS at room temperature and two washes with 0.1X SSC/0.1% SDS at 

50° C. Expression of NgR2 and/or NgR3 can be assessed by the presence of an 
appropriately sized band on the blot. 

Example 8: Cloning of cDNA corresponding to NgRs 

25 

To obtain the full-length clone corresponding to NgR2 from a cDNA library, 
the following method may be used. A cDNA library is generated using standard 
methods from a tissue known to contain NgR2. Such a tissue was identified in 
Example 2. 1 x 10 6 plaque forming units from the cDNA library may be screened in 
30 duplicate on OPTITRAN™ filters. The filters are hybridized with 32 P -labeled 

oligonucleotides that are generated from the ESTs corresponding to portions of NgR2. 
The hybridization reaction may consist of 400 mis plaque screen buffer (50mM Tris pH 
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7.5, lMNaCl, 0.1% Sodium pyrophosphate, 0.2% Polyvinylpryolidine and 0.2% 
Ficoll) containing 10% Dextran sulfate and lOOpg/ml tRNA and 80 pmol each 
32 P -labeled oligonucleotide at 65°C overnight. The filters are washed twice with 2X 
SSC/1%SDS and twice with IX SSC/1%SDS and exposed to film. Duplicate positives 
5 are purified. DNA from each of these clones is analyzed by restriction enzyme digest 
followed by agarose gel electrophoresis and Southern blotting. The filters are 
hybridized to the 32 P-labeled oligonucleotides used for the original hybridization to 
confirm that inserts hybridize to the probe. The insert is then sequenced to confirm that 
it represents the cDNA for NgR2. Similar methods may be used to generate a 
1 0 full-length clone corresponding to NgR3 . 

Alternatively, a full-length clone of NgR2 or NgR3 can be obtained by a 
person of ordinary skill in the art employing conventional PCR techniques. 

Example 9: Hybridization Analysis to demonstrate NgR expression in the brain 

1 5 The expression of NgR in mammals, such as the rat, may be investigated by in 

situ hybridization histochemistry. To investigate expression in the brain, for example, 
coronal and sagittal rat brain cryosections (20 pm thick) are prepared using a Reichert- 
Jung cryostat. Individual sections are thaw-mounted onto silanized, nuclease-free 
slides (CEL Associates, Inc., Houston, TX), and stored at -80°C. Sections are 
20 processed starting with post-fixation in cold 4% paraformaldehyde, rinsed in cold 

phosphate-buffered saline (PBS), acetylated using acetic anhydride in triethanolamine 
buffer, and dehydrated through a series of alcohol washes in 70%, 95%, and 100% 
alcohol at room temperature. Subsequently, sections are delipidated in chloroform, 
followed by rehydration through successive exposure to 100% and 95% alcohol at 
25 room temperature. Microscope slides containing processed cryosections are allowed 
to air dry prior to hybridization. Other tissues may be assayed in a similar fashion. 

A NgR-specific probe may be generated using PCR. Following PCR 
amplification, the fragment is digested with restriction enzymes and cloned into 
pBluescript II cleaved with the same enzymes. For production of a probe specific for 
30 the sense strand of NgR, a cloned NgR fragment cloned in pBluescript II may be 
linearized with a suitable restriction enzyme, which provides a substrate for labeled 
run-off transcripts (i.e., cRNA riboprobes) using the vector-borne T7 promoter and 
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commercially available T7 RNA polymerase. A probe specific for the antisense strand 
of NgR may also be readily prepared using the NgR clone in pBluescript II by cleaving 
the recombinant plasmid with a suitable restriction enzyme to generate a linearized 
substrate for the production of labeled run-off cRNA transcripts using the T3 promoter 
5 and cognate polymerase. The riboprobes may be labeled with [ SJ-UTP to yield a 
specific activity of about 0.40 x 10 6 cpm/pmol for antisense riboprobes and about 0.65 
x 10 6 cpm/pmol for sense-strand riboprobes. Each riboprobe may be subsequently 
denatured and added (2 pmol/ml) to hybridization buffer which contains 50% 
formamide, 10% dextran, 0.3 MNaCl, 10 mM Tris (pH 8.0), 1 mMEDTA, IX 
10 Denhardt's Solution, and 10 mM dithiothreitol. Microscope slides containing 

sequential brain cryosections may be independently exposed to 45 pi of hybridization 
solution per slide and silanized cover slips may be placed over the sections being 
exposed to hybridization solution. Sections are incubated overnight (15-18 hours) at 
52°C to allow hybridization to occur. Equivalent series of cryosections are then 
15 exposed to sense or antisense NgR-specific cRNA riboprobes. 

Following the hybridization period, coverslips are washed off the slides in IX 
SSC, followed by RNase A treatment involving the exposure of slides to 20 pg/ml 
RNase A in a buffer containing 10 mM Tris-HCl (pH 7.4), 0.5 M EDTA, and 0.5 M 
NaCl for 45 minutes at 37°C. The cryosections are then subjected to three 
20 high-stringency washes in 0. 1 X SSC at 52°C for 20 minutes each. Following the 

series of washes, cryosections are dehydrated by consecutive exposure to 70%, 95%, 
and 100% ammonium acetate in alcohol, followed by air drying and exposure to 
Kodak BioMax™ MR-1 film. After 13 days of exposure, the film is developed, and 
any significant hybridization signal is detected. Based on these results, slides containing 
25 tissue that hybridized, as shown by film autoradiograms, are coated with Kodak NTB- 
2 nuclear track emulsion and the slides are stored in the dark for 32 days. The slides 
are then developed and counterstained with hematoxylin. Emulsion-coated sections 
are analyzed microscopically to determine the specificity of labeling. The signal is 
determined to be specific if autoradiographic grains (generated by antisense probe 
30 hybridization) are clearly associated with cresyl violate-stained cell bodies. 

Autoradiographic grains found between cell bodies indicate non-specific binding of the 
probe. 
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In some cases, such as using a probe to detect a NgR homolog in a 
heterologous species, in order to achieve optimal hybridization, it may be necessary to 
decrease the stringency conditions. Such conditions are well known to those of 
ordinary skill in the art and examples are provided above. 

5 Expression of NgR in the brain provides an indication that modulators of NgR 

activity have utility for treating neurological disorders. Some other diseases for which 
modulators of NgR may have utility include depression, anxiety, bipolar disease, 
epilepsy, neuritis, neurasthenia, neuropathy, neuroses, and the like. Use of NgR 
modulators, including NgR ligands and anti-NgR antibodies, to treat individuals having 
10 such disease states is intended as an aspect of the invention. 

Example 10: Northern Blot Analysis of NgR-RNA with a PCR-generated Probe 

Northern blot hybridizations may be performed to examine the expression of 
NgR mRNA. A clone containing at least a portion of the sequence of SEQ ID NO: 1 
15 may be used as a probe. Vector-specific primers are used in PCR to generate a 
hybridization probe fragment for P-labeling. The PCR is performed as follows: 


20 


25 


Mix: 


1 pil NgR-containing plasmid 

2)j,l fwd primer (10-50 pM) 

2)0,1 rev primer (10-50 pM) 

lOpl lOxPCR buffer (such as that provided with the enzyme, 
Amersham Pharmacia Biotech) 

1 pi lOmM dNTP (such as #1 969 064 from Boehringer Mannheim) 

0. 5)0,1 Taq polymerase (such as #27-0799-62, Amersham Pharmacia 
Biotech) 

83.5)0,1 water 
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PCR is performed in a Thermocycler using the following program: 


94°C 5min 


5 94°C 

55°C 
72°C 


lmin 

lmin 30 cycles 
lmin 


72°C lOmin | 

10 

The PCR product may be purified using QIAquick PCR Purification Kit 
‘ (#28104) from Qiagen, and radictively labeled with 32 P-dCTP (#AA0005/250, 
Amersham Pharmacia Biotech)) may be done by random priming using "Ready-to-go 
DNA Labeling Beads" (#27-9240-01) from Amersham Pharmacia Biotech. 

1 5 Hybridization is carried out on Human Multiple Tissue Northern Blot from Clontech as 
described in manufacturer's protocol, or on a Northern Blot prepared by running RNA 
samples from cells of interest on an agarose gel and blotting to a membrane using any 
of the known Northern blotting protocols. After exposure overnight on Molecular 
Dynamics Phosphor Imager screen (#MD 146-8 14) bands of an appropriate size are 

20 visualized. 


Example 11: Recombinant Expression of NgR in Eukaryotic Host Cells 
A. Expression of NgR in Mammalian Cells 

To produce NgR protein, a NgR-encoding polynucleotide is expressed in a 
25 suitable host cell using a suitable expression vector and standard genetic engineering 
techniques. For example, a NgR-encoding sequence described in Table 4 is subcloned 
into the commercial expression vector pzeoSV2 (Invitrogen, San Diego, CA) and 
transfected into Chinese Hamster Ovary (CHO) cells using the transfection reagent 
FuGENE6™ (Boehringer-Mannheim) and the transfection protocol provided in the 
30 product insert. Other eukaryotic cell lines, including human embryonic kidney (HEK 
293) and COS cells, are suitable as well. Cells stably expressing NgR are selected by 
growth in the presence of 100 pg/ml zeocin (Stratagene, La Jolla, CA). As an 
alternative to FuGENE6™, the expression vector may carry the gene for dihydrofolate 
reductase (dhfr) and selection of clones with methotrexate (MTX) drug pressure 
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allows for stable transformation of CHO cells. Optionally, NgR may be purified from 
the cells using standard chromatographic techniques. To facilitate purification, antisera 
is raised against one or more synthetic peptide sequences that correspond to portions 
of the NgR amino acid sequence, and the antisera is used to affinity purify Nogo-R. 

5 The NgR also may be expressed in-frame with a tag sequence (e.g . , polyhistidine, 
hemaglutinin, FLAG) to facilitate purification. Moreover, it will be appreciated that 
many of the uses for NgR polypeptides, such as assays described below, do not require 
purification of NgR from the host cell. 

10 B. Expression of NgR in CHO cells 

For expression of NgR in Chinese hamster ovary (CHO) cells, a plasmid 
bearing the relevant NgR coding sequence is prepared, using a vector which also bears 
the selectable marker dihydrofolate reductase (DHFR). The plasmid is transfected into 
CHO cells. Selection under MTX drug pressure allows for preparation of stable 
15 transformants of a NgR (NgR2 or NgR3) in an expression plasmid carrying a 

selectable marker such as DHFR. 

C. Expression of NgR in 293 cells 

For expression of NgR in mammalian cells 293 (transformed human, primary 
20 embryonic kidney cells), a plasmid bearing the relevant NgR coding sequence is 
prepared, using vector pSecTag2A (Invitrogen). Vector pSecTag2A contains the 
murine IgK chain leader sequence for secretion, the c-myc epitope for detection of the 
recombinant protein with the anti-myc antibody, a C-terminal polyhistidine for 
purification with nickel chelate chromatography, and a Zeocin resistant gene for 
25 selection of stable transfectants. The forward primer for amplification of this NgR 
cDNA is determined by routine procedures and preferably contains a 5' extension of 
nucleotides to introduce the Hindlll cloning site and nucleotides matching the NgR 
sequence. The reverse primer is also determined by routine procedures and preferably 
contains a 5' extension of nucleotides to introduce an Xhol restriction site for cloning 
30 and nucleotides corresponding to the reverse complement of the NgR sequence. The 
PCR conditions are 55°C as the annealing temperature. The PCR product is gel 
purified and cloned into the HindlTL-Xhol sites of the vector. 
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The DNA is purified using Qiagen chromatography columns and transfected 
into 293 cells using DOTAP™ transfection media (Boehringer Mannheim, 

Indianapolis, IN). Transiently transfected cells are tested for expression after 24 hours 
of transfection, using western blots probed with anti-His and anti-NgR peptide 
5 antibodies. Permanently transfected cells are selected with Zeocin and propagated. 
Production of the recombinant protein is detected from both cells and media by 
Western blots probed with anti-His, anti-Myc or anti-NgR peptide antibodies. 

D. Transient Expression of Nogo-R in COS cells 

10 For expression of the NgR in COS7 cells, a polynucleotide molecule having a 

nucleotide sequence of SEQ ID NO: 1, for example, can be cloned into vector p3-CI. 
This vector is a pUC18-derived plasmid that contains the HCMV (human 
cytomegalovirus) promoter-intron located upstream from the bGH (bovine growth 
hormone) polyadenylation sequence and a multiple cloning site. 

15 The forward primer is determined by routine procedures and preferably 

contains a 5' extension which introduces an Xbal restriction site for cloning, followed 
by nucleotides which correspond to a nucleotide sequence of SEQ ID NO: 1 . The 
reverse primer is also determined by routine procedures and preferably contains 5'- 
extension of nucleotides which introduces a Sal I cloning site followed by nucleotides 
20 which correspond to the reverse complement of a nucleotide sequence of SEQ ID 
NO:l. 

The PCR consists of an initial denaturation step of 5 min at 95 °C, 30 cycles of 
30 sec denaturation at 95°C, 30 sec annealing at 58°C and 30 sec extension at 72°C, 
followed by 5 min extension at 72°C. The PCR product is gel purified and ligated into 
25 the Xbal and Sail sites of vector p3-CI. This construct is transformed into E. coli cells 
for amplification and DNA purification. The DNA is purified with Qiagen 
chromatography columns and transfected into COS 7 cells using Lipofectamine™ 
reagent from BRL, following the manufacturer's protocols. Forty-eight and 72 hours 
after transfection, the media and the cells are tested for recombinant protein 
30 expression. 

NgR expressed from a COS cell culture can be purified by concentrating the 
cell-growth media to about 10 mg of protein/ml, and purifying the protein by, for 
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example, chromatography. Purified NgR is concentrated to 0.5 mg/ml in an Amicon 
concentrator fitted with a YM-10 membrane and stored at -80°C. NgR3 may also be 
expressed using this method and the nucleotide sequence of SEQ ID NO: 3 or SEQ ID 
NO:13. 

5 

E. Expression of NgR in Insect Cells 

For expression of NgR in a baculovirus system, a polynucleotide molecule 
having a nucleotide sequence of SEQ ID NO: 1, 3 or 13 can be amplified by PCR. The 
forward primer is determined by routine procedures and preferably contains a 5' 

10 extension which adds the Ndel cloning site, followed by nucleotides which correspond 
to a nucleotide sequence of SEQ ID NO: 1 (or SEQ ID NO: 3 or SEQ ID NO: 13, 
respectively). The reverse primer is also determined by routine procedures and 
preferably contains a 5' extension which introduces the Kpnl cloning site, followed by 
nucleotides which correspond to the reverse complement of a nucleotide sequence of 
15 SEQ ID NO:l (or SEQ ID NO:3 or SEQ ID NO:13, respectively). 

The PCR product is gel purified, digested with Ndel and Kpnl, and cloned into 
the corresponding sites of vector pACHTL-A (Pharmingen, San Diego, CA). The 
pAcHTL expression vector contains the strong polyhedrin promoter of the 
Autographa californica nuclear polyhedrosis virus (AcMNPV), and a 6XHis tag 
20 upstream from the multiple cloning site. A protein kinase site for phosphorylation and 
a thrombin site for excision of the recombinant protein precede the multiple cloning 
site is also present. Of course, many other baculovirus vectors could be used in place 
of pAcHTL-A, such as pAc373, pVL941 and pAcIMl. Other suitable vectors for the 
expression of NgR polypeptides can be used, provided that the vector construct 
25 includes appropriately located signals for transcription, translation, and trafficking, 

such as an in-frame AUG and a signal peptide, as required. Such vectors are described 
inLuckow et al.. Virology 170:31-39, among others. 

The virus is grown and isolated using standard baculovirus expression methods, 
such as those described in Summers et al. (1987) A MANUAL OF METHODS FOR 
3 0 Baculovirus Vectors and Insect Cell Culture Procedures, Texas Agricultural 
Experimental Station Bulletin No. 1555. 



WO 02/29059 


PCT/US01/31488 


- 121 - 

In a preferred embodiment, pAcHLT-A containing NgR gene is introduced into 
baculovirus using the "BaculoGold™" transfection kit (Pharmingen, San Diego, CA) 
using methods established by the manufacturer. Individual virus isolates are analyzed 
for protein production by radiolabeling infected cells with 35 S-methionine at 24 hours 
5 post infection. Infected cells are harvested at 48 hours post infection, and the labeled 
proteins are visualized by SDS-PAGE. Viruses exhibiting high expression levels can 
be isolated and used for scaled up expression. 

For expression of a NgR polypeptide in a Sf9 cells, a polynucleotide molecule 
having the nucleotide sequence of SEQ ID NO: 1 (or SEQ ID NO: 3 or SEQ ID 
10 NO: 13) can be amplified by PCR using the primers and methods described above for 
baculovirus expression. The NgR cDNA is cloned into vector pAcHLT-A 
(Pharmingen) for expression in Sf9 insect. The insert is cloned into the Nde I and Kpnl 
sites, after elimination of an internal Ndel site (using the same primers described above 
for expression in baculovirus). DNA is purified with Qiagen chromatography columns 
15 and expressed in Sf9 cells. Preliminary Western blot experiments from non-purified 
plaques are tested for the presence of the recombinant protein of the expected size 
which reacted with the NgR-specific antibody. These results are confirmed after 
further purification and expression optimization in HiG5 cells. 

20 F. Expression of soluble forms of NgR2 and NgR3 as NgR-Ig fusion 

proteins. 

To generate a NgR2-Ig fusion protein, standard methods may be used as 
described in the literature (e.g. Sanicola et al. (1997) Proc. Natl. Acad. Sci. USA. 94, 
6238-6243). For example, a DNA fragment encoding NgR2 without the sequence 
25 encoding the hydrophobic C-terminus (GPI anchor signal) may be ligated to a DNA 
fragment encoding the Fc domain of IgGl (which may be human IgGl), and the 
chimeric fragment may be cloned into an expression vector to generate a plasmid. The 
plasmid may then be transfected into Chinese hamster ovary cells to generate a stable 
cell line producing the fusion protein. The fusion protein is then purified from 
30 conditioned media using standard methods. For example, clarified conditioned media 
from the cell line may be loaded by gravity directly onto Protein A Sepharose. The 
column may then be washed with five column volumes each of PBS, PBS containing 



WO 02/29059 


PCT/US01/31488 


- 122 - 

0.5 MNaCl, and 25 mM sodium phosphate, 100 mMNaCl (pH 5.0). The bound 
protein may then be eluted with 25 mM NaH 2 P0 4 , 100 mM NaCl (pH 2.8) and 
immediately neutralized with 1/10 fraction volume of 0.5 M Na 2 HP0 4 (pH 8.6). 

Similar methods may be used to generate a NgR3-Ig fusion protein. 

5 

Example 12: Interaction Trap/Two-Hybrid System 

In order to assay for NgR-interacting proteins, the interaction trap/two-hybrid 
library screening method can be used. This assay was first described in Fields et al 
(1989) Nature 340, 245, which is incorporated herein by reference in its entirety. A 
10 protocol is published in Current Protocols in Molecular Biology 1999, John 

Wiley & Sons, NY and Ausubel, F. M. et al 1992, Short Protocols IN Molecular 
Biology, fourth edition, Greene and Wiley-interscience, NY, which is incorporated 
herein by reference in its entirety. Kits are available from Clontech, Palo Alto, CA 
(Matchmaker Two-Hybrid System 3). 

15 A fusion of the nucleotide sequences encoding all or partial NgR and the yeast 

transcription factor GAL4 DNA-binding domain (DNA-BD) is constructed in an 
appropriate plasmid (/'.<?., pGBKT7) using standard subcloning techniques. Similarly, a 
GAL4 active domain (AD) fusion library is constructed in a second plasmid (/'. e . , 
pGADT7) from cDNA of potential NgR-binding proteins (for protocols on forming 
20 cDNA libraries, see Sambrook et al 1989, MOLECULAR CLONING: A LABORATORY 

Manual, second edition. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY), which is incorporated herein by reference in its entirety. The DNA-BD/NgR 
fusion construct is verified by sequencing, and tested for autonomous reporter gene 
activation and cell toxicity, both of which would prevent a successful two-hybrid 
25 analysis. Similar controls are performed with the AD/library fusion construct to ensure 
expression in host cells and lack of transcriptional activity. Yeast cells are transformed 
(ca. 105 transformants/mg DNA) with both the NgR and library fusion plasmids 
according to standard procedure (Ausubel, et al , 1992, SHORT PROTOCOLS IN 
Molecular Biology, fourth edition, Greene and Wiley-interscience, NY, which is 
30 incorporated herein by reference in its entirety). In vivo binding of DNA-BD/NgR 
with AD/library proteins results in transcription of specific yeast plasmid reporter 
genes (/. e . , lacZ, HIS3, ADE2, LEU2 ). Yeast cells are plated on nutrient-deficient 
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media to screen for expression of reporter genes. Colonies are dually assayed for 
p-galactosidase activity upon growth in Xgal 

(5-bromo-4-chloro-3-indolyl-b-D-galactoside) supplemented media (filter assay for 
b-galactosidase activity is described in Breeden et al., (1985) Cold Spring Harb. Symp. 
5 Quant. Biol., 50, 643, which is incorporated herein by reference in its entirety). 

Positive AD-library plasmids are rescued from transformants and reintroduced into the 
original yeast strain as well as other strains containing unrelated DNA-BD fusion 
proteins to confirm specific NgR/library protein interactions. Insert DNA is sequenced 
to verify the presence of an open reading frame fused to GAJL4 AD and to determine 
1 0 the identity of the NgR-binding protein. 

Example 13: Antibodies to Nogo-R 

Standard techniques are employed to generate polyclonal or monoclonal 
antibodies to the NgR receptor, and to generate useful antigen-binding fragments 
15 thereof or variants thereof, including "humanized" variants. Such protocols can be 
found, for example, in Sambrook et al. (1989), above, and Harlow et al. (Eds.), 
Antibodies A Laboratory manual; Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY (1988). In one embodiment, recombinant NgR polypeptides (or 
cells or cell membranes containing such polypeptides) are used as antigen to generate 
20 the antibodies. In another embodiment, one or more peptides having amino acid 

sequences corresponding to an immunogenic portion of NgR (e.g., 6, 7, 8, 9, 10, 11, 
12, 13, 14, 15, 16, 17, 18, 19, 20, or more amino acids) are used as antigen. Peptides 
corresponding to extracellular portions of Nogo-R, especially hydrophilic extracellular 
portions, are preferred. The antigen may be mixed with an adjuvant or linked to a 
25 hapten to increase antibody production. 

A. Polyclonal or Monoclonal antibodies 

As one exemplary protocol, recombinant NgR or a synthetic fragment thereof 
is used to immunize a mouse for generation of monoclonal antibodies (or larger 
30 mammal, such as a rabbit, for polyclonal antibodies). To increase antigenicity, 

peptides are conjugated to Keyhole Limpet Hemocyanin (Pierce), according to the 
manufacturer's recommendations. For an initial injection, the antigen is emulsified with 
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Freund's Complete Adjuvant and injected subcutaneously. At intervals of two to three 
weeks, additional aliquots of NgR antigen are emulsified with Freund's Incomplete 
Adjuvant and injected subcutaneously. Prior to the final booster injection, a serum 
sample is taken from the immunized mice and assayed by western blot to confirm the 
5 presence of antibodies that immunoreact with NgR. Serum from the immunized 

animals may be used as polyclonal antisera or used to isolate polyclonal antibodies that 
recognize NgR. Alternatively, the mice are sacrificed and their spleen removed for 
generation of monoclonal antibodies. 

To generate monoclonal antibodies, the spleens are placed in 10 ml serum-free 
10 RPMI 1640, and single cell suspensions are formed by grinding the spleens in serum- 
free RPMI 1640, supplemented with 2 mM L-glutamine, 1 mM sodium pyruvate, 100 
units/ml penicillin, and 100 pg/ml streptomycin (RPMI) (Gibco, Canada). The cell 
suspensions are filtered and washed by centrifugation and resuspended in serum-free 
RPMI. Thymocytes taken from three naive Balb/c mice are prepared in a similar 
15 manner and used as a Feeder Layer. NS-1 myeloma cells, kept in log phase in RPMI 
with 10% fetal bovine serum (FBS) (Hyclone Laboratories, Inc., Logan, UT) for three 
days prior to fusion, are centrifuged and washed as well. 

To produce hybridoma fusions, spleen cells from the immunized mice are 
combined with NS-1 cells and centrifuged, and the supernatant is aspirated. The cell 
20 pellet is dislodged by tapping the tube, and 2 ml of 37°C PEG 1500 (50% in 75 mM 
HEPES, pH 8.0) (Boehringer-Mannheim) is stirred into the pellet, followed by the 
addition of serum-free RPMI. Thereafter, the cells are centrifuged, resuspended in 
RPMI containing 15% FBS, 100 pM sodium hypoxanthine, 0.4 pM aminopterin, 16 
pM thymidine (HAT) (Gibco), 25 units/ml IL-6 (Boehringer-Mannheim) and 1.5 x 10 6 
25 thymocytes/ml, and plated into 10 Corning flat-bottom 96-well tissue culture plates 
(Corning, Corning, NY). 

On days 2, 4, and 6 after the fusion, 100 pi of medium is removed from the 
wells of the fusion plates and replaced with fresh medium. On day 8, the fusions are 
screened by ELISA, testing for the presence of mouse IgG that binds to NgR. 

30 Selected fusion wells are further cloned by dilution until monoclonal cultures 
producing anti-NgR antibodies are obtained. 
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B. Humanization of anti-NgR monoclonal antibodies 

The expression pattern of NgR as reported herein and the potential ofNgRs as 
targets for therapeutic intervention suggest therapeutic indications for NgR inhibitors 
(antagonists). NgR-neutralizing antibodies comprise one class of therapeutics useful as 
5 NgR antagonists. Following are protocols to improve the utility of anti-NgR 

monoclonal antibodies as therapeutics in humans by "humanizing" the monoclonal 
antibodies to improve their serum half-life and render them less immunogenic in human 
hosts (i.e., to prevent human antibody response to non-human anti-NgR antibodies). 

The principles of humanization have been described in the literature and are 
10 facilitated by the modular arrangement of antibody proteins. To minimize the 
possibility of binding complement, a humanized antibody of the IgG4 isotype is 
preferred. 

For example, a level of humanization is achieved by generating chimeric 
antibodies comprising the variable domains of non-human antibody proteins of interest 
15 with the constant domains of human antibody molecules. (See, e.g., Morrison et al., 
(1989) Adv. Immunol., 44, 65-92). The variable domains of NgR-neutralizing anti- 
NgR antibodies are cloned from the genomic DNA of a B-cell hybridoma or from 
cDNA generated from mRNA isolated from the hybridoma of interest. The V region 
gene fragments are linked to exons encoding human antibody constant domains, and 
20 the resultant construct is expressed in suitable mammalian host cells (e.g., myeloma or 
CHO cells). 

To achieve an even greater level of humanization, only those portions of the 
variable region gene fragments that encode antigen-binding complementarity 
determining regions ("CDR") of the non-human monoclonal antibody genes are cloned 
25 into human antibody sequences. (See, e.g., Jones et al., (1986) Nature 321, 522-525; 
Riechmann et al., (1988) Nature 332, 323-327; Verhoeyen et al., (1988) Science 239, 
1534-1536 ; and Tempest et al., (1991) Bio/Technology 9, 266-271). If necessary, the 
13-sheet framework of the human antibody surrounding the CDR3 regions also is 
modified to more closely mirror the three dimensional structure of the antigen-binding 
30 domain of the original monoclonal antibody. (See Kettleborough et al., (1991) Protein 
Engin. 4, 773-783; and Foote et al., (1992) J. Mol. Biol. 224, 487-499). 
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In an alternative approach, the surface of a non-human monoclonal antibody of 
interest is humanized by altering selected surface residues of the non-human antibody, 
e.g., by site-directed mutagenesis, while retaining all of the interior and contacting 
residues of the non-human antibody. See Padlan (1991) Mol. Immunol. 28, 489-498 . 

5 The foregoing approaches are employed using NgR-neutralizing anti-NgR 

monoclonal antibodies and the hybridomas that produce them to generate humanized 
NgR-neutralizing antibodies useful as therapeutics to treat or palliate conditions 
wherein NgR expression or ligand-mediated NgR signaling is detrimental. 

10 C. Human NgR-Neutralizing Antibodies from Phage Display 

Human NgR-neutralizing antibodies are generated by phage display techniques 
such as those described in Aujame etal. (1997) Human Antibodies 8, 155-168; 
Hoogenboom (1997) TIB TECH 15, 62-70; and Rader etal. (1997), Curr. Opin. 
Biotechnol. 8, 503-508, all of which are incorporated by reference. For example, 

15 antibody variable regions in the form of Fab fragments or linked single chain Fv 

fragments are fused to the amino terminus of filamentous phage minor coat protein 
pill. Expression of the fusion protein and incorporation thereof into the mature phage 
coat results in phage particles that present an antibody on their surface and contain the 
genetic material encoding the antibody. A phage library comprising such constructs is 
20 expressed in bacteria, and the library is screened for NgR-specific phage-antibodies 
using labeled or immobilized NgR as antigen-probe. 

D. Human NgR-neutralizing antibodies from transgenic mice 

Human NgR-neutralizing antibodies are generated in transgenic mice essentially 
25 as described in Bruggemann et al. (1996) Immunol. Today 17, 391-397 and 

Bruggemann etal. (1997) Curr. Opin. Biotechnol. 8, 455-458. Transgenic mice 
carrying human V-gene segments in germline configuration and that express these 
transgenes in their lymphoid tissue are immunized with a NgR composition using 
conventional immunization protocols, hybridomas are generated using B cells from the 
30 immunized mice using conventional protocols and screened to identify hybridomas 
secreting anti-NgR human antibodies {e.g., as described above). 
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Example 14: Assays to Identify Modulators of NgR Activity 

Set forth below are several nonlimiting assays for identifying modulators 
(agonists and antagonists) of NgR activity. Among the modulators that can be 
identified by these assays are natural ligand compounds of the receptor; synthetic 
5 analogs and derivatives of natural ligands; antibodies, antibody fragments, and/or 
antibody-like compounds derived from natural antibodies or from antibody-like 
combinatorial libraries; and/or synthetic compounds identified by high-throughput 
screening of libraries; and the like. All modulators that bind NgR are useful for 
identifying NgR in tissue samples (e.g., for diagnostic purposes, pathological purposes, 
10 and the like). Agonist and antagonist modulators are useful for up-regulating and 
down-regulating NgR activity, respectively, to treat disease states characterized by 
abnormal levels of NgR activity. The assays may be performed using single putative 
modulators, and/or may be performed using a known agonist in combination with 
candidate antagonists (or visa versa). 

15 

A. cAMP Assays 

In one type of assay, levels of cyclic adenosine monophosphate (cAMP) are 
measured in NgR-transfected cells that have been exposed to candidate modulator 
compounds. Protocols for cAMP assays have been described in the literature. (See, 
20 e.g., Sutherland et al., (1968) Circulation 37, 279; Frandsen et al., (1976) Life 

Sciences 18, 529-541; Dooley et al., (1997) J. Pharmacol. Exp. Therap. 283, 735-41; 
and George et al., (1997) J. Biomol. Screening 2, 235-40). An exemplary protocol for 
such an assay, using an Adenylyl Cyclase Activation FlashPlate® Assay from NEN™ 
Life Science Products, is set forth below. 

25 Briefly, the NgR coding sequence (e.g., a cDNA or intronless genomic DNA) 

is subcloned into a commercial expression vector, such as pzeoSV2 (Invitrogen), and 
transiently transfected into Chinese Hamster Ovary (CHO) cells using known methods, 
such as the transfection protocol provided by Boehringer-Mannheim when supplying 
the FuGENE 6 transfection reagent. Transfected CHO cells are seeded into 96-well 
30 microplates from the FlashPlate® assay kit, which are coated with solid scintillant to 
which antisera to cAMP has been bound. For a control, some wells are seeded with 
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wild type (untransfected) CHO cells. Other wells in the plate receive various amounts 
of a cAMP standard solution for use in creating a standard curve. 

One or more test compounds (i. e . , candidate modulators) are added to the cells 
in each well, with water and/or compound-free medium/diluent serving as a control or 
5 controls. After treatment, cAMP is allowed to accumulate in the cells for exactly 15 
minutes at room temperature. The assay is terminated by the addition of lysis buffer 
containing [ 125 I]-labeled cAMP, and the plate is counted using a Packard Topcount™ 
96-well microplate scintillation counter. Unlabeled cAMP from the lysed cells (or 
from standards) and fixed amounts of [ I]-cAMP compete for antibody bound to the 
10 plate. A standard curve is constructed, and cAMP values for the unknowns are 

obtained by interpolation. Changes in intracellular cAMP levels of cells in response to 
exposure to a test compound are indicative of NgR modulating activity. Modulators 
that act as agonists of receptors which couple to the G s subtype of G proteins will 
stimulate production of cAMP, leading to a measurable 3-10 fold increase in cAMP 
15 levels. Agonists of receptors which couple to the Gi/o subtype of G proteins will inhibit 
forskolin-stimulated cAMP production, leading to a measurable decrease in cAMP 
levels of 50-100%. Modulators that act as inverse agonists will reverse these effects at 
receptors that are either constitutively active or activated by known agonists. 

20 B. Aequorin Assays 

In another assay, cells (e.g., CHO cells) are transiently co-transfected with both 
a NgR expression construct and a construct that encodes the photoprotein apoaquorin. 
In the presence of the cofactor coelenterazine, apoaquorin will emit a measurable 
luminescence that is proportional to the amount of intracellular (cytoplasmic) free 
25 calcium. (See generally, Cobbold, et al. "Aequorin measurements of cytoplasmic free 
calcium," In: McCormack J.G. and Cobbold P.H., eds., CELLULAR CALCIUM: A 
Practical Approach. Oxford:IRL Press (1991); Stables et al., (1997) Anal. 
Biochem. 252, 115-26; andHaugland, HANDBOOK OF FLUORESCENT PROBES AND 
Research Chemicals. Sixth edition. Molecular Probes, Eugene, OR (1996)). 

30 In one exemplary assay, NgR is subcloned into the commercial expression 

vector pzeoSV2 (Invitrogen) and transiently co-transfected along with a construct that 
encodes the photoprotein apoaquorin (Molecular Probes, Eugene, OR) into CHO cells 
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using the transfection reagent FuGENE 6 (Boehringer-Mannheim) and the transfection 
protocol provided in the product insert. 

The cells are cultured for 24 hours at 37'C in MEM (Gibco/BRL, 

Gaithersburg, MD) supplemented with 10% fetal bovine serum, 2 raM glutamine, 10 
5 U/ml penicillin and 10 pg/ml streptomycin, at which time the medium is changed to 
serum-free MEM containing 5 pM coelenterazine (Molecular Probes, Eugene, OR). 
Culturing is then continued for two additional hours at 37°C. Subsequently, cells are 
detached from the plate using VERSEN (Gibco/BRL), washed, and resuspended at 
200,000 cells/ml in serum-free MEM. 

10 Dilutions of candidate NgR modulator compounds are prepared in serum-free 

MEM and dispensed into wells of an opaque 96-well assay plate at 50 pl/well. Plates 
are then loaded onto an MLX microtiter plate luminometer (Dynex Technologies, Inc., 
Chantilly, VA). The instrument is programmed to dispense 50 pi cell suspensions into 
each well, one well at a time, and immediately read luminescence for 15 seconds. 

15 Dose-response curves for the candidate modulators are constructed using the area 
under the curve for each light signal peak. Data are analyzed with SlideWrite, using 
the equation for a one-site ligand, and EC 50 values are obtained. Changes in 
luminescence caused by the compounds are considered indicative of modulatory 
activity. Modulators that act as agonists at receptors which couple to the G q subtype 

20 of G proteins give an increase in luminescence of up to 100 fold. Modulators that act 

as inverse agonists will reverse this effect at receptors that are either constitutively 
active or activated by known agonists. 

C. Luciferase Reporter Gene Assay 

25 The photoprotein luciferase provides another useful tool for assaying for 

modulators of NgR activity. Cells (e.g., CHO cells or COS 7 cells) are transiently co- 
transfected with both a NgR expression construct (e.g, NgR in pzeoSV2) and a 
reporter construct which includes a gene for the luciferase protein downstream from a 
transcription factor binding site, such as the cAMP-response element (CRE), AP-1, or 

30 NF-kappa B. Expression levels of luciferase reflect the activation status of the 

signaling events. (See generally, George et al. (1997) J. Biomol. Screening 2, 235- 
240; and Stratowa et al. (1995) Cnrr. Opin. Biotechnol. 6, 574-581). Luciferase 
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activity may be quantitatively measured using, e.g., luciferase assay reagents that are 
commercially available from Promega (Madison, WI). 

In one exemplary assay, CHO cells are plated in 24-well culture dishes at a 
density of 100,000 cells/well one day prior to transfection and cultured at 37°C in 
5 MEM (Gibco/BRL) supplemented with 10% fetal bovine serum, 2 mM glutamine, 10 
U/ml penicillin and 10 pg/ml streptomycin. Cells are transiently co-transfected with 
both a NgR expression construct and a reporter construct containing the luciferase 
gene. The reporter plasmids CRE-luciferase, AP-1 -luciferase and NF-kappaB- 
luciferase may be purchased from Stratagene (Legally, CA). Transfections are 
10 performed using the FuGENE 6 transfection reagent (Boehringer-Mannheim) 

according to the supplier's instructions. Cells transfected with the reporter construct 
alone are used as a control. Twenty-four hours after transfection, cells are washed 
once with PBS pre-warmed to 37°C. Serum-free MEM is then added to the cells 
either alone (control) or with one or more candidate modulators and the cells are 
1 5 incubated at 37°C for five hours. Thereafter, cells are washed once with ice-cold PBS 
and lysed by the addition of 100 gl of lysis buffer per well from the luciferase assay kit 
supplied by Promega. After incubation for 15 minutes at room temperature, 15 gl of 
the lysate is mixed with 50 gl of substrate solution (Promega) in an opaque-white, 
96-well plate, and the luminescence is read immediately on a Wallace model 1450 
20 MicroBeta scintillation and luminescence counter (Wallace Instruments, Gaithersburg, 
MD). 

Differences in luminescence in the presence versus the absence of a candidate 
modulator compound are indicative of modulatory activity. Receptors that are either 
constitutively active or activated by agonists typically give a 3 -20-fold stimulation of 
25 luminescence compared to cells transfected with the reporter gene alone. Modulators 
that act as inverse agonists will reverse this effect. 

D. Intracellular calcium measurement using FLIPR 

Changes in intracellular calcium levels are another recognized indicator of 
30 receptor activity, and such assays can be employed to screen for modulators of NgR 
activity. For example, CHO cells stably transfected with a NgR expression vector are 
plated at a density of 4 x 10 4 cells/well in Packard black-walled, 96-well plates 
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specially designed to discriminate fluorescence signals emanating from the various 
wells on the plate. The cells are incubated for 60 minutes at 37°C in modified 
Dulbecco's PBS (D-PBS) containing 36 mg/L pyruvate and 1 g/L glucose with the 
addition of 1% fetal bovine serum and one of four calcium indicator dyes (Fluo-3™ 

5 AM, Fluo-4™ AM, Calcium Green™-1 AM, or Oregon Green™ 488 BAPTA-1 AM), 
each at a concentration of 4 pM. Plates are washed once with modified D-PBS 
without 1% fetal bovine serum and incubated for 10 minutes at 37“C to remove 
residual dye from the cellular membrane. In addition, a series of washes with modified 
D-PBS without 1% fetal bovine serum is performed immediately prior to activation of 
10 the calcium response. 

A calcium response is initiated by the addition of one or more candidate 
receptor agonist compounds, calcium ionophore A23187 (10 pM; positive control), or 
ATP (4 pM; positive control). Fluorescence is measured by Molecular Device's 
FLIPR with an argon laser (excitation at 488 nm). (See, e.g., Kuntzweiler et al. 

15 (1998) Drug Dev. Res. 44,14-20). The F-stop for the detector camera is set at 2.5 and 

the length of exposure is 0.4 milliseconds. Basal fluorescence of cells is measured for 
20 seconds prior to addition of candidate agonist, ATP, or A23187, and the basal 
fluorescence level is subtracted from the response signal. The calcium signal is 
measured for approximately 200 seconds, taking readings every two seconds. Calcium 
20 ionophore A23 1 87 and ATP increase the calcium signal 200% above baseline levels. 

In general, activated NgRs increase the calcium signal at least about 10-15% above 
baseline signal. 

E. [ 35 S]GTP Y S Binding Assay 

25 It is also possible to evaluate whether NgR signals through a G 

protein-mediated pathway. Because G protein-coupled receptors signal through 
intracellular G proteins whose activity involves GTP binding and hydrolysis to yield 
bound GDP, measurement of binding of the non-hydrolyzable GTP analog [ 35 S]- 
GTPyS in the presence and absence of candidate modulators provides another assay 
30 for modulator activity. (See, e.g., Kowal et al., (1998) Neuropharmacology 37, 179- 
187.). 
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In one exemplary assay, cells stably transfected with a NgR expression vector 
are grown in 10 cm tissue culture dishes to subconfluence, rinsed once with 5 ml of 
ice-cold Ca 2+ /Mg 2+ -fr ee phosphate-buffered saline, and scraped into 5 ml of the same 
buffer. Cells are pelleted by centrifugation (500 x g, 5 minutes), resuspended in TEE 
5 buffer (25 mM Tris, pH 7.5 , 5 mM EDTA, 5 mM EGTA), and frozen in liquid 

nitrogen. After thawing, the cells are homogenized using a Dounce homogenizer (1 ml 
TEE per plate of cells), and centrifuged at 1,000 x g for 5 minutes to remove nuclei 
and unbroken cells. 

The homogenate supernatant is centrifuged at 20,000 x g for 20 minutes to 
10 isolate the membrane fraction, and the membrane pellet is washed once with TEE and 
resuspended in binding buffer (20 mM HEPES, pH 7.5, 150 mM NaCl, 10 mM MgCl 2 , 
1 mM EDTA). The resuspended membranes can be frozen in liquid nitrogen and 
stored at -70°C until use. 

Aliquots of cell membranes prepared as described above and stored at -70°C 
15 are thawed, homogenized, and diluted into buffer containing 20 mM HEPES, 10 mM 
MgCl 2 , 1 mM EDTA, 120 mM NaCl, 10 pM GDP, and 0.2 mM ascorbate, at a 
concentration of 10-50 pg/ml. In a final volume of 90 pi, homogenates are incubated 
with varying concentrations of candidate modulator compounds or 100 pM GTP for 
30 minutes at 30°C and then placed on ice. To each sample, 10 pi guanosine 
20 5'-0-(3[ 35 S]thio) triphosphate (NEN, 1200 Ci/mmol; [ 35 S]-GTPyS), was added to a 

final concentration of 100-200 pM. Samples are incubated at 30°C for an additional 30 
minutes, 1 ml of 10 mM HEPES, pH 7.4, 10 mM MgCl 2 , at 4°C is added and the 
reaction is stopped by filtration. 

Samples are filtered over Whatman GF/B filters and the filters are washed with 
25 20 ml ice-cold 10 mM HEPES, pH 7.4, 10 mM MgCl 2 . Filters are counted by liquid 

scintillation spectroscopy. Nonspecific binding of [ S]-GTPyS is measured in the 
presence of 100 pM GTP and subtracted from the total. Compounds are selected that 
modulate the amount of [ 35 S]-GTPyS binding in the cells, compared to untransfected 
control cells. Activation of receptors by agonists gives up to a five-fold increase in 
30 [ 35 S]-GTPyS binding. This response is blocked by antagonists. 

F. [ 3 H]Arachidonic Acid Release 
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The activation of NgRs may also potentiate arachidonic acid release in cells, 
providing yet another useful assay for modulators ofNgR activity. (See, e.g., 
Kanterman et al . , (1991) Mol. Pharmacol. 3'9,364-369.) For example, CHO cells that 
are stably transfected with a NgR expression vector are plated in 24-well plates at a 
5 density of 15,000 cells/well and grown in MEM medium supplemented with 10% fetal 
bovine serum, 2 mM glutamine, 10 U/ml penicillin and 10 pg/ml streptomycin for 48 
hours at 37°C before use. Cells of each well are labeled by incubation with 
[ 3 H]-arachidonic acid (Amersham Corp., 210 Ci/mmol) at 0.5 [xCi/ml in 1 ml MEM 
supplemented with 10 mM HEPES, pH 7.5, and 0.5% fatty-acid-free bovine serum 
10 albumin for 2 hours at 37°C. The cells are then washed twice with 1 ml of the same 
buffer. 

Candidate modulator compounds are added in 1 ml of the same buffer, either 
alone or with 10 pM ATP and the cells are incubated at 37°C for 30 minutes. Buffer 
alone and mock-transfected cells are used as controls. Samples (0.5 ml) from each 
15 well are counted by liquid scintillation spectroscopy. Agonists which activate the 
receptor will lead to potentiation of the ATP-stimulated release of [ H]-arachidonic 
acid. This potentiation is blocked by antagonists. 

G. Extracellular Acidification Rate 

20 In yet another assay, the effects of candidate modulators ofNgR activity are 

assayed by monitoring extracellular changes in pH induced by the test compounds (see, 
e.g., Dunlop et al. (1998) J. Pharmacol. Toxicol. Meth. 40, 47-55). In one 
embodiment, CHO cells transfected with a NgR expression vector are seeded into 12 
mm capsule cups (Molecular Devices Corp.) at 4 x 10 5 cells/cup in MEM 
25 supplemented with 10% fetal bovine serum, 2 mM L-glutamine, 10 U/ml penicillin, and 
10 pg/ml streptomycin. The cells are incubated in this medium at 37°C in 5% C0 2 for 
24 hours. 

Extracellular acidification rates are measured using a Cytosensor 
microphysiometer (Molecular Devices Corp.). The capsule cups are loaded into the 
30 sensor chambers of the microphysiometer and the chambers are perfused with running 
buffer (bicarbonate-free MEM supplemented with 4 mM L-glutamine, 10 units/ml 
penicillin, 10 ng/ml streptomycin, 26 mMNaCl) at a flow rate of 100 pl/minute. 
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Candidate agonists or other agents are diluted into the running buffer and perfused 
through a second fluid path. During each 60-second pump cycle, the pump is run for 
38 seconds and is off for the remaining 22 seconds. The pH of the running buffer in 
the sensor chamber is recorded during the cycle from 43-58 seconds, and the pump is 
5 re-started at 60 seconds to start the next cycle. The rate of acidification of the running 
buffer during the recording time is calculated by the Cytosoft program. Changes in the 
rate of acidification are calculated by subtracting the baseline value (the average of 4 
rate measurements immediately before addition of a modulator candidate) from the 
highest rate measurement obtained after addition of a modulator candidate. The 
10 selected instrument detects 61 mV/pH unit. Modulators that act as agonists of the 
receptor result in an increase in the rate of extracellular acidification compared to the 
rate in the absence of agonist. This response is blocked by modulators which act as 
antagonists of the receptor. 

15 Example 15: mNgR3 does not bind hNogo-A(1055-1120) 

To functionally test the mouse NgR3 (hereinafter, mNgR3) for its ability to 
bind hNogo-A(1055-l 120), a cDNA expression vector for a myc epitope-tagged 
mNgR3 protein was created. The mouse NgR3 cDNA was amplified by PCR from 
mouse adult brain cDNA, from the signal sequence to the stop codon, and ligated into 
20 the pSecTag2 vector such that the vector encodes a signal sequence followed by a myc 
tag followed by the mature mNgR3 sequence. This plasmid was transfected into 
COS07cells, and expression of a myc-tagged protein of the predicted size was verified 
by immunoblot analysis. Alkaline phosphatase-hNogo-A(1055-1120) binding studies 
and myc immunohistology were conducted as described (Fournier et al., supra). 

25 The cells expressing mNgR3 express the myc-tagged protein but binding to 

AP-hNogo-A(1055-1120) was not observed under the conditions employed (Fig. 8). 

Example 16: Identification of partial human NgR3 cDNA and protein sequences 

The tblastn program was used to search for the human homolog of mouse 
30 NgR3 . The mouse NgR3 protein sequence (SEQ ID NO:4) was used to query a 

proprietary human expressed sequence tag (EST) database from Incyte yielding one 
highly significant hit: Incyte Template ID 190989.1. This sequence (937 nucleotides) 
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contains an open reading frame of 312 amino acids in the second reverse frame that 
exhibits 88% identity with residues 66 to 381 of mouse NgR3 (SEQ ID NO:4), 
strongly indicating that it is part of the human NgR3 homolog. 

A query of SEQ ID NO: 4 against the public human EST database in Genbank 
5 also produced a hit with a 465-bp EST (Accession number: R35699; Version number: 
R35699.1; GI: 792600). There are a number of single nucleotide deletions and 
insertions within this sequence which cause frame shift errors. All of the reliable 
sequence contained in this public EST is present in the Incyte EST (Template ID 
190989.1). 

10 To obtain more nucleotide sequence that would extend the amino acid 

sequence at that carboxy terminal end, the I.M.A.G.E. Consortium clone No. 38319, 
which corresponds to Genbank accession No. R35699, was purchased from Incyte 
Genomics Inc. and subjected to further DNA sequence analysis. This clone consists of 
a Notl/HinD III fragment containing the sequence of interest, cloned into the 
15 Notl/HinD III sites of the vector Lafmid BA 

(http://image.llnl.gov/image/html/libs/lafmidBA.shtml). The clone was received as an 
agar stab, which was streaked out on LB agar plates containing 50ug/ml ampicillin to 
isolate individual colonies. Six colonies were grown in LB medium with antibiotic, and 
plasmid DNA was prepared using the Promega Wizard Plus Miniprep DNA 
20 Purification System (Promega #A7500). These DNAs were subsequently digested 
with Notl and HinD III restriction enzymes to confirm that the clones contained an 
insert. The insert of one isolate was sequenced using a combination of vector specific 
and gene specific primers yielding a partial nucleotide sequence of human NgR3 of 
1 176 nucleotides (SEQ ID NO: 13). A translation of this sequence provides a partial 
25 sequence for human NgR3 of 392 amino acids (SEQ ID NO: 14). 

The nucleotide sequence of SEQ ID NO: 13 differs from the Incyte EST 
sequence at three positions. Nucleotide positions 12-13 in SEQ ID NO: 13 are CG, 
whereas the corresponding nucleotides in the Incyte Template ID 190989.1 are GT 
(i.e., positions 12-13 of the complement of Incyte Template ID 190989.1). In 
30 addition, position 641 in SEQ ID NO: 13 is a C, whereas the corresponding nucleotide 
in the Incyte Template ID 190989.1 sequence is an A (i.e., position 641 of the 
complement of Incyte Template ID 190989.1). This results in two changes in amino 
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acids when comparing SEQ ID NO: 14 to the ORF encoded by Incyte Template 
190989. 1 : SEQ ID NO: 14 contains a valine at position 5, whereas the ORF encoded 
by Incyte Template ID 190989.1 contains a leucine; SEQ ID NO: 14 contains an 
alanine at position 214, whereas the ORF encoded by Incyte Template ID 190989.1 
5 contains a glutamic acid. 

The nucleotide sequence of SEQ ID NO: 13 differs from the public EST 
(Accession number: R35699; Version number: R35699. 1; GI: 792600) sequence at 
two positions (within the first 200 nucleotides of reliable sequence). Nucleotide 
positions 12-13 in SEQ ID NO: 13 are CG, whereas the corresponding nucleotides in 
10 the public EST are GT (i.e., positions 12-13 of the public EST; Accession no: R35699; 
Version no: R35699.1; GI: 792600) This leads to a single amino acid change when 
comparing SEQ ID NO: 14 to the ORF encoded by the public EST: SEQ ID NO: 14 
contains a valine at position 5, while the ORF encoded by the public EST contains a 
leucine. 

15 A Bestfit analysis of the partial human amino acid sequence with the full-length 

mouse amino acid sequence indicates that the human NgR3 amino acid sequence is 
complete at the carboxy terminal end and that they share 89.54% identity. An 
alignment of all the NgR proteins is shown in Figure 9. Although the human NgR3 
amino acid sequence is missing the first 25 amino acids, it can be determined that the 
20 human NgR3 protein contains the following features in common with the other NgR 
sequences: (1) eight Leucine Rich Repeat (LRR) domains; (2) an LRR carboxy- 
terminal (LRR-CT) domain; (3) a conserved cysteine in the fourth LRR domain; (4) a 
conserved potential glycosylation site in the eighth LRR domain; and (5) a 
hydrophobic carboxyl terminus. 

25 As those skilled in the art will appreciate, numerous changes and modifications 

may be made to the preferred embodiments of the invention without departing from the 
spirit of the invention. It is intended that all such variations fall within the scope of the 
invention. 

The entire disclosure of each publication cited herein is hereby incorporated by 
30 reference. This application claims benefit from United States provisional application 
60/238,361, filed October 6, 2000, which is incorporated by reference herein in its 
entirety. 
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Key for Sequence Listing: 

SEQ ID NO: 1 human NgR2 cDNA sequence derived from genomic sequence 

AC013606 

SEQ ID NO:2 human NgR2 amino acid sequence 

5 SEQ ID NO:3 mouse NgR3 cDNA sequence derived from AC021768 

SEQ ID NO:4 a mouse NgR3 amino acid sequence 

SEQ ID NO: 5 a human NgRl amino acid sequence 

SEQ ID NO: 6 a consensus amino acid sequence for NgRs 

SEQ ID NO:7 #1055-1 120 amino acid residues of hNogoA (Nogo-66) 

10 SEQ ID NO: 8 a mature human NgR2 amino acid sequence 

SEQ ID NO: 9 a mature mouse NgR3 amino acid sequence 

SEQ ID NO: 10 a consensus NgR LLRNT amino acid sequence 

SEQ ID NO: 11 a consensus NgR LRRCT domain amino acid sequence 

SEQ ID NO: 12 a consensus NgR LRR domain amino acid sequence 

15 SEQ ID NO: 13 a partial human NgR3 nucleotide sequence 

SEQ ID NO: 14 a partial human NgR3 amino acid sequence 

SEQ ID NO: 15 a genomic sequence encoding a human NgR2 sequence. 

SEQ ID NO: 16 a genomic sequence (complementary strand) encoding a mouse 

NgR3 

20 SEQ ID NO: 17 a mouse NgRl amino acid sequence 

SEQ ID NO: 18 a consensus sequence for the NTLRRCT domain of NgR 

SEQ ID NO: 19 an consensus NgR LRRCT domain amino acid sequence 


30 
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What is claimed is: 

1 . An isolated nucleic acid comprising a nucleotide sequence encoding 
a polypeptide comprising an LRRCT domain consisting of the amino acid sequence: 


NX, wx 2 cx 3 crarx 4 lwx 5 w x 6 x 7 x 8 x 9 r x 10 s s sx„ V 


10 


X 12 C X 13 X 14 P X 15 x 16 


X 17 X 18 X 19 X 20 D L X 21 X 22 L X 23 X 24 X 25 D 


X 26 X 27 X 28 C [SEQ ID NO: 19] 

15 wherein X is any amino acid or a gap and the polypeptide does not comprise 

the amino acid sequence from residue 260 to 309 of SEQ ID NO: 5 (human NgRl) or 
SEQ ID NO: 17 (mouse NgRl). 


2. The isolated nucleic acid according to claim 1, wherein X 17 and X 23 
20 are independently selected from the group consisting of: arginine and lysine. 

3. The isolated nucleic acid according to claim 2, wherein the amino 
acid sequence of the LRRCT domain is selected from the group consisting of: residues 
#261-310 of SEQ ID NO:2 and residues 261-310 of SEQ ID NO: 2 with up to 10 

25 conservative amino acid substitutions. 


4. An isolated nucleic acid encoding the polypeptide of SEQ ID NO: 2. 

5. An isolated nucleic acid encoding the polypeptide of SEQ ID NO: 4 
30 (mouse NgR3) or SEQ ID NO: 14 (human NgR3). 


6. The isolated nucleic acid according to claim 1, wherein the 
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polypeptide comprises: (a) a NTLRRCT domain, and (b) less than a complete CTS 
domain, provided that a partial CTS domain, if present, consists of no more than the 
first 39 amino acids of the CTS domain. 


5 7. The isolated nucleic acid to claim 1, wherein the polypeptide does 

not comprise an intact GPI domain. 

8. An isolated nucleic acid consisting essentially of a nucleotide 
sequence complementary to a nucleotide sequence encoding a polypeptide selected 
10 from the group consisting of: a polypeptide consisting of residues 311-395 of SEQ ID 
NO: 2, a polypeptide consisting of residues 256-396 of SEQ ID NO: 14 and a 
polypeptide consisting of residues 321-438 of SEQ ID NO: 4, wherein the nucleic acid 
is from 8 to 100 nucleotides in length. 

15 9. A vector comprising the nucleic acid of any one of claims 1, 4 or 5. 

10. A host cell comprising a vector according to claim 9. 


20 


11. A polypeptide comprising a LRRCT amino acid sequence: 

N Xj W X 2 C X 3 C R A*R X 4 L W X 5 W X 6 X 7 X 8 X 9 R X 10 S S S X„ V 


25 


X 12 C X„ X, d P X„ X 1fi X, 7 X,„ X 10 X™ D L X,, X„ L X,, X„ X„ D 


H3 vv 14 


^13 


‘-le - /v 17 ^-18 ^19 


■20 


V 21 ^22 ■ 


^23 ^24 yv 25 


X 26 X 27 X 28 C [SEQ ID NO: 19] 


wherein X is any amino acid residue or a gap and the polypeptide does not 
comprise the amino acid sequence from residue 260 to 309 of SEQ ID NO: 5 
(human NgRl) or SEQ ID NO: 17 (mouse NgRl). 

12. The polypeptide according to claim 1 1, wherein X 17 and X 23 is 
selected from the group consisting of arginine and lysine. 
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13. The polypeptide according to claim 1 1, wherein X 19 is glycine. 
[SEQ ID NO: 11] 

14. The polypeptide according to claim 1 1, wherein the amino acid 

5 sequence is selected from the group consisting of residues 261—310 of SEQ ID NO:2, 
residues 206-255 of SEQ ID NO: 14, residues 271-320 of SEQ ID NO:4 and amino 
acid sequences thereof comprising a conservative substitution. 


10 


15 


20 


25 


15. A polypeptide comprising a NTLRRCT amino acid sequence: 


C P XjX 2 C X 3 C Y X 4 X 5 P X 6 X 7 T X 8 S C X 9 X 10 X u X 12 X 13 X 14 X 15 X 16 P 
x„ X 18 X 19 P X 20 X 21 X 22 X 23 R X 24 F L X 25 X 26 N X 27 I X 28 X 29 X 30 X 31 X 32 X 33 
X 34 F X 3S X 36 X 37 X 38 X 39 X 40 X 41 X 42 L W X 43 X 44 S N X 45 X 46 X 47 X 48 1 X 49 
X 50 X 51 X 52 F X 53 X 54 X 55 X 56 X 57 L E X 58 L D L X 59 D N X 60 X 61 L X 62 X 63 X 64 
X 65 P X 66 T F X 67 G L X 6g X 69 L X 70 X 71 L X 72 L X 73 X 74 C X 75 L X 76 X 77 L X 78 
X 79 X 80 X 81 F X 82 G L X 83 X 84 L Q Y L Y L Q X 85 N X 86 X 87 X 88 X 89 L X 90 D 
X 91 X 92 F X 93 D L X 94 N L X 95 H L F L H GN X 96 X 97 X 98 X 99 X 100 X 101 X 102 
X 103 X 104 F R G L X 105 X 106 LDRLLLH X 107 N X 108 X 109 X 110 X U1 V H X 112 
X U3 A F X 114 X 115 L X 116 R L X 117 X 118 L X 119 L F X 120 N X 121 L X 122 X 123 L 
x i24 X 125 X 126 X 127 L X 128 X 129 L X 130 X 131 L X 132 X 133 L R L N X 134 N X 135 W 
X 136 C X 137 C R X 138 R X 139 L W X 140 W X 141 X 142 X 143 X 144 R X 145 S S S X 146 
vx 147 c Xi 4 s x i49 P X 150 X 151 X I52 X 153 X 154 X 15J D L X J56 X 157 L Xi 58 X 159 X lfi0 
D X 161 X 162 X 163 C [SEQ ID NO:18] 


wherein X is any amino acid residue or a gap and wherein the polypeptide is 
not the polypeptide of SEQ ID NO: 5 (human NgRl) or SEQ ID NO: 17 
(mouse NgRl). 


16. The polypeptide according to claim 15, wherein X 6 , X 37 and X 38 
30 represents a gap. 


17. A polypeptide comprising an amino sequence selected from the 
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group consisting of: SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO: 14. 

18. The polypeptide according any one of claims 11, 15 or 17, wherein 
the polypeptide comprises: (a) an NTLRRCT domain, and (b) less than a complete 

5 CTS domain, provided that a partial CTS domain, if present, consists of no more than 
the first 39 amino acids of the CTS domain. 

19. The polypeptide according to any one of claims 11, 15 or 17, 
wherein the polypeptide does not comprise an intact GPI domain. 

10 

20. The polypeptide according to any one of claims 11, 15 or 17, 
wherein the amino acid sequence of the polypeptide further comprises an amino acid 
sequence of a heterologous polypeptide. 

15 21. The polypeptide according to claim 20, wherein the heterologous 

polypeptide is an Fc portion of an antibody. 

22. A method of producing a polypeptide according to any one of 
claims 11, 15 or 17, comprising the steps of introducing an isolated nucleic acid 

20 according to any one of claims 1, 4, 5 or 8 or a vector according to claim 9 into a host 
cell, culturing said host cell under conditions suitable for expression of said 
polypeptide, and recovering said polypeptide. 

23. An antibody that binds to a polypeptide of any one of claims 11,15 

25 or 17. 

24. A composition comprising the polypeptide of claim 1 1, 15 or 17 
and a pharmaceutically acceptable carrier. 

30 25. A composition comprising the antibody of claim 23 and a 

pharmaceutically acceptable carrier. 
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26. A method of decreasing inhibition of axonal growth of a CNS 
neuron, comprising the step of contacting the neuron with an effective amount of the 
polypeptide of claim 11, 15 or 17. 

5 27. A method of treating a central nervous system disease, disorder or 

injury, comprising administering to a mammal an effective amount of the polypeptide 
of claim 11, 15 or 17. 

28. A method of decreasing inhibition of axonal growth of a CNS 
10 neuron comprising the step of contacting the neuron with an effective amount of the 

antibody according to claim 23 . 

29. A method of treating a central nervous system disease, disorder or 
injury, comprising administering to a mammal an effective amount of the antibody 

1 5 according to claim 23 . 

30. A method for identifying a molecule that binds a polypeptide of 
15 or 17 comprising the steps of: 

(a) providing a polypeptide of claim 11, 15 or 17; 

(b) contacting the polypeptide with the candidate molecule; 

(c) detecting binding of the candidate molecule to the 
polypeptide. 

25 
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SEQUENCE LISTING 


<110> BIOGEN, INC. 

YALE UNIVERSITY 
STRITTMATTER, STEPHEN M. 

CATE, RICHARD L. 

SAH, DINAH W.Y. 

<12 0> NOGO RECEPTOR HOMOLOGS 

<130> A116PCT 

<140> 

<14 1> 

<150> 60/238,361 
<151> 2000-10-06 

<160> 16 

<170> Patentln Ver. 2.1 

<210> 17 
<211> 1260 
<212> DNA 
<213> Homo sapiens 

<400> 1 

atgctgcccg ggctcaggcg cctgctgcaa gctcccgcct cggcctgcct cctgctgatg 60 

ctcctggccc tgcccctggc ggcccccagc tgccccatgc tctgcacctg ctactcatcc 120 

ccgcccaccg tgagctgcca ggccaacaac ttctcctctg tgccgctgtc cctgccaccc 180 

agcactcagc gactcttcct gcagaacaac ctcatccgca cgctgcggcc aggcaccttt 240 

gggtccaacc tgctcaccct gtggctcttc tccaacaacc tctccaccat ctacccgggc 300 

actttccgcc acttgcaagc cctggaggag ctggacctcg gtgacaaccg gcacctgcgc 360 

tcgctggagc ccgacacctt ccagggcctg gagcggctgc agtcgctgca tttgtaccgc 420 

tgccagctca gcagcctgcc cggcaacatc ttccgaggcc tggtcagcct gcagtacctc 480 

tacctccagg agaacagcct gctccaccta caggatgact tgttcgcgga cctggccaac 540 

ctgagccacc tcttcctcca cgggaaccgc ctgcggctgc tcacagagca cgtgtttcgc 600 

ggcctgggca gcctggaccg gctgctgctg cacgggaacc ggctgcaggg cgtgcaccgc 660 

gcggccttcc gcggcctcag ccgcctcacc atcctctacc tgttcaacaa cagcctggcc 720 

tcgctgcccg gcgaggcgct cgccgacctg ccctcgctcg agttcctgcg gctcaacgct 780 

aacccctggg cgtgcgactg ccgcgcgcgg ccgctctggg cctggttcca gcgcgcgcgc 840 

gtgtccagct ccgacgtgac ctgcgccacc ccccc ggagc gccagggccg agacctgcgc 900 

gcgctccgcg aggccgactt ccaggcgtgt ccgcccgcgg cacccacgcg gccgggcagc 960 

cgcgcccgcg gcaacagctc ctccaaccac ctgtacgggg tggccgaggc cggggcgccc 1020 

ccagccgatc cctccaccct ctaccgagat ctgcctgccg aagactcgcg ggggcgccag 1080 

ggcggggacg cgcctactga ggacgactac tgggggggct acgggggtga ggaccagcga 1140 

ggggagcaga tgtgccccgg cgctgcctgc caggcgcccc cggactcccg aggccctgcg 1200 

ctctcggccg ggctccccag ccctctgctt tgcctcctgc tcctggtgcc ccaccacctc 1260 


< 210 > 2 
<211> 420 
<212> PRT 

<213> Homo sapiens 
<400> 2 


Met 

1 

Leu 

Pro 

Gly 

Leu 

5 

Arg 

Arg 

Leu 

Leu 

Gin 

10 

Ala 

Pro 

Ala 

Ser 

Ala 

15 

Cys 

Leu 

Leu 

Leu 

Met 

20 

Leu 

Leu 

Ala 

Leu 

Pro 

25 

Leu 

Ala 

Ala 

Pro 

Ser 

30 

Cys 

Pro 
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Met 

Leu 

Cys 

35 

Thr 

Cys 

Tyr 

Ser 

Ser 

40 

Pro 

Pro 

Thr 

Val 

Ser 

45 

Cys 

Gin 

Ala 

Asn 

Asn 

50 

Phe 

Ser 

Ser 

Val 

Pro 

55 

Leu 

Ser 

Leu 

Pro 

Pro 

60 

Ser 

Thr 

Gin 

Arg 

Leu 

65 

Phe 

Leu 

Gin 

Asn 

Asn 

70 

Leu 

lie 

Arg 

Thr 

Leu 

75 

Arg 

Pro 

Gly 

Thr 

Phe 

80 

Gly 

Ser 

Asn 

Leu 

Leu 

85 

Thr 

Leu 

Trp 

Leu 

Phe 

90 

Ser 

Asn 

Asn 

Leu 

Ser 

95 

Thr 

lie 

Tyr 

Pro 

Gly 

100 

Thr 

Phe 

Arg 

His 

Leu 

105 

Gin 

Ala 

Leu 

Glu 

Glu 

110 

Leu 

Asp 

Leu 

Gly 

Asp 

115 

Asn 

Arg 

His 

Leu 

Arg 

120 

Ser 

Leu 

Glu 

Pro 

Asp 

125 

Thr 

Phe 

Gin 

Gly 

Leu 

130 

Glu 

Arg 

Leu 

Gin 

Ser 

135 

Leu 

His 

Leu 

Tyr 

Arg 

140 

Cys 

Gin 

Leu 

Ser 

Ser 

145 

Leu 

Pro 

Gly Asn 

lie 

150 

Phe 

Arg 

Gly 

Leu 

Val 

155 

Ser 

Leu 

Gin 

Tyr 

Leu 

160 

Tyr 

Leu 

Gin 

Glu 

Asn 

165 

Ser 

Leu 

Leu 

His 

Leu 

170 

Gin 

Asp 

Asp 

Leu 

Phe 

175 

Ala 

Asp 

Leu 

Ala 

Asn 

180 

Leu 

Ser 

His 

Leu 

Phe 

185 

Leu 

His 

Gly 

Asn 

Arg 

190 

Leu 

Arg 

Leu 

Leu 

Thr 

195 

Glu 

His 

Val 

Phe 

Arg 

200 

Gly 

Leu 

Gly 

Ser 

Leu 

205 

Asp 

Arg 

Leu 

Leu 

Leu 

210 

His 

Gly Asn Arg 

Leu 

215 

Gin 

Gly 

Val 

His 

Arg 

220 

Ala 

Ala 

Phe 

Arg 

Gly 

225 

Leu 

Ser 

Arg 

Leu 

Thr 

230 

He 

Leu 

Tyr 

Leu 

Phe 

235 

Asn 

Asn 

Ser 

Leu 

Ala 

240 

Ser 

Leu 

Pro 

Gly 

Glu 

245 

Ala 

Leu 

Ala 

Asp 

Leu 

250 

Pro 

Ser 

Leu 

Glu 

Phe 

255 

Leu 

Arg 

Leu 

Asn 

Ala 

260 

Asn 

Pro 

Trp 

Ala 

Cys 

265 

Asp 

Cys 

Arg 

Ala 

Arg 

270 

Pro 

Leu 

Trp 

Ala 

Trp 

275 

Phe 

Gin Arg 

Ala 

Arg 

280 

Val 

Ser 

Ser 

Ser 

Asp 

285 

Val 

Thr 

Cys 

Ala 

Thr 

290 

Pro 

Pro 

Glu 

Arg 

Gin 

295 

Gly 

Arg 

Asp 

Leu 

Arg 

300 

Ala 

Leu 

Arg 

Glu 

Ala 

305 

Asp 

Phe 

Gin 

Ala 

Cys 

310 

Pro 

Pro 

Ala 

Ala 

Pro 

315 

Thr 

Arg 

Pro 

Gly 

Ser 

320 

Arg 

Ala 

Arg 

Gly 

Asn 

325 

Ser 

Ser 

Ser 

Asn 

His 

330 

Leu 

Tyr 

Gly 

Val 

Ala 

335 

Glu 

Ala 

Gly 

Ala 

Pro 

340 

Pro 

Ala 

Asp 

Pro 

Ser 

345 

Thr 

Leu 

Tyr 

Arg 

Asp 

350 

Leu 

Pro 

Ala 

Glu 

Asp 

355 

Ser 

Arg 

Gly 

Arg 

Gin 

360 

Gly 

Gly Asp 

Ala 

Pro 

365 

Thr 

Glu 

Asp 

Asp 

Tyr 

Trp 

Gly 

Gly 

Tyr 

Gly 

Gly 

Glu 

Asp 

Gin 

Arg 

Gly 

Glu 

Gin 

Met 
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370 375 380 

Cys Pro Gly Ala Ala Cys Gin Ala Pro Pro Asp Ser Arg Gly Pro Ala 

385 390 395 400 

Leu Ser Ala Gly Leu Pro Ser Pro Leu Leu Cys Leu Leu Leu Leu Val 

405 410 415 

Pro His His Leu 
420 


<210> 3 
<211> 1383 
<212> DNA 
<213> Mus sp. 

<400> 3 

atgtcttggc agtctggaac cacagtgaca caatctcccg tgcaggctgc tcaggtctca 60 

gggtgctgtg tggaattgct gctgttgctg ctcgctggag agctacctct gggtggtggt 120 

tgtcctcgag actgtgtgtg ctaccctgcg cccatgactg tcagctgcca ggcacacaac 180 

tttgctgcca tcccggaggg catcccagag gacagtgagc gcatcttcct gcagaacaat 240 

cgcatcacct tcctccagca gggccacttc agccccgcca tggtcaccct ctggatctac 300 

tccaacaaca tcactttcat tgctcccaac accttcgagg gctttgtgca tctggaggag 360 

ctagaccttg gagacaaccg acagctgcga acgctggcac ccgagacctt ccaaggcctg 420 

gtgaagcttc acgccctcta cctctataag tgtggactga gcgccctgcc cgcaggcatc 480 

tttggtggcc tgcacagcct gcagtatctc tacttgcagg * acaaccatat cgagtacctc 540 
caagatgaca tctttgtgga cctggtcaat ctcagtcaci: tgtttctcca tggtaacaag 600 

ctatggagcc tgggccaagg catcttccgg ggcctggtga acctggaccg gttgctgctg 660 

catgagaacc agctacagtg ggttcaccac aaggctttcc atgacctcca caggctaacc 720 

accctctttc tcttcaacaa cagcctcact gagctgcagg gtgactgtct ggcccccctg 780 

gtggccttgg agttccttcg cctcaatggg aatgcttggg actgtggctg ccgggcacgt 840 

tccctgtggg aatggctgcg aaggttccgt ggctctagct ctgctgtccc ctgcgcgacc 900 

cccgagctgc ggcaaggcca ggatctgaag ctgctgaggg tggaggactt ccggaactgc 960 

acaggaccag tgtctcctca ccagatcaag tctcacacgc ttaccacctc tgacagggct 1020 

gcccgcaagg agcaccatcc gtcccatggg gcctccaggg acaaaggcca cccacatggc 1080 

catccgcctg gctccaggtc aggttacaag aaggcaggca agaactgcac cagccacagg 1140 

aaccggaacc agatctctaa ggtgagctct gggaaagagc ttaccgaact gcaggactat 1200 

gcccccgact atcagcacaa gttcagcttt gacatcatgc ccaccgcacg acccaagagg 1260 

aagggoaagt gtgctcgcag gacccccatc cgtgccccca gtggggtgca gcaggcatcc 1320 

tcaggcacgg cccttggggc cccactcctg gcctggatac tggggctggc agtcactctc 1380 

cgc 1383 


<210> 4 
<211> 461 
<212> PRT 
<213> Mus sp . 

<400> 4 

Met Ser Trp Gin Ser Gly Thr Thr Val Thr Gin Ser Pro Val Gin Ala 
15 10 15 

Ala Gin Val Ser Gly Cys Cys Val Glu Leu Leu Leu Leu Leu Leu Ala 
20 25 30 

Gly Glu Leu Pro Leu Gly Gly Gly Cys Pro Arg Asp Cys Val Cys Tyr 
35 40 45 

Pro Ala Pro Met Thr Val Ser Cys Gin Ala His Asn Phe Ala Ala lie 
50 55 60 

Pro Glu Gly lie Pro Glu Asp Ser Glu Arg lie Phe Leu Gin Asn Asn 
65 70 75 80 
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Arg lie Thr Phe Leu Gin Gin Gly His 
85 

Leu Trp lie Tyr Ser Asn Asn lie Thr 
100 105 

Glu Gly Phe Val His Leu Glu Glu Leu 
115 120 

Leu Arg Thr Leu Ala Pro Glu Thr Phe 
130 135 

Ala Leu Tyr Leu Tyr Lys Cys Gly Leu 
145 150 

Phe Gly Gly Leu His Ser Leu Gin Tyr 
165 

lie Glu Tyr Leu Gin Asp Asp lie Phe 
180 185 

His Leu Phe Leu His Gly Asn Lys Leu 
195 200 

Phe Arg Gly Leu Val Asn Leu Asp Arg 
210 215 

Leu Gin Trp Val His His Lys Ala Phe 
225 230 

Thr Leu Phe Leu Phe Asn Asn Ser Leu 
245 

Leu Ala Pro Leu Val Ala Leu Glu Phe 
260 265 

Trp Asp Cys Gly Cys Arg Ala Arg Ser 
275 280 

Phe Arg Gly Ser Ser Ser Ala Val Pro 
290 295 

Gin Gly Gin Asp Leu Lys Leu Leu Arg 
305 310 

Thr Gly Pro Val Ser Pro His Gin lie 
325 

Ser Asp Arg Ala Ala Arg Lys Glu His 
340 345 

Arg Asp Lys Gly His Pro His Gly His 
355 360 

Tyr Lys Lys Ala Gly Lys Asn Cys Thr 
370 375 

lie Ser Lys Val Ser Ser Gly Lys Glu 
385 390 

Ala Pro Asp Tyr Gin His Lys Phe Ser 
4 05 

Arg Pro Lys Arg Lys Gly Lys Cys Ala 


PCT/US01/31488 


Phe Ser Pro Ala Met Val Thr 
90 95 

Phe lie Ala Pro Asn Thr Phe 
110 

Asp Leu Gly Asp Asn Arg Gin 
125 

Gin Gly Leu Val Lys Leu His 
140 

Ser Ala Leu Pro Ala Gly lie 
155 160 

Leu Tyr Leu Gin Asp Asn His 
170 175 

Val Asp Leu Val Asn Leu Ser 
190 

Trp Ser Leu Gly Gin Gly lie 
205 

Leu Leu Leu His Glu Asn Gin 
220 

His Asp Leu His Arg Leu Thr 
235 ' 240 

Thr Glu Leu Gin Gly Asp Cys 
250 255 

Leu Arg Leu Asn Gly Asn Ala 
270 

Leu Trp Glu Trp Leu Arg Arg 
285 

Cys Ala Thr Pro Glu Leu Arg 
300 

Val Glu Asp Phe Arg Asn Cys 
315 320 

Lys Ser His Thr Leu Thr Thr 
330 335 

His Pro Ser His Gly Ala Ser 
350 

Pro Pro Gly Ser Arg Ser Gly 
365 

Ser His Arg Asn Arg Asn Gin 
380 

Leu Thr Glu Leu Gin Asp Tyr 
395 400 

Phe Asp lie Met Pro Thr Ala 
410 415 

Arg Arg Thr Pro lie Arg Ala 
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420 





Pro 

Ser 

Gly 

435 

Val 

Gin 

Gin 

Ala 

Ser 

440 

Leu 

Leu 

Ala 

Trp 

lie 

Leu 

Gly 

Leu 


450 455 


<210> 5 

<211> 473 

<2 12> PRT 

<213> Homo sapiens 

<400> 5 

Met Lys Arg Ala Ser Ala Gly Gly 
1 5 

Trp Leu Gin Ala Trp Gin Val Ala 
2 0 

Cys Tyr Asn Glu Pro Lys Val Thr 
35 40 

Gin Ala Val Pro Val Gly lie Pro 
50 55 

His Gly Asn Arg lie Ser His Val 
65 70 

Arg Asn Leu Thr lie Leu Trp Leu 
85 

Asp Ala Ala Ala Phe Thr Gly Leu 
100 

Ser Asp Asn Ala Gin Leu Arg Ser 
115 120 

Leu Gly Arg Leu His Thr Leu His 
130 135 

Leu Gly Pro Gly Leu Phe Arg Gly 
145 150 

Leu Gin Asp Asn Ala Leu Gin Ala 
165 

Leu Gly Asn Leu Thr His Leu Phe 
180 

Val Pro Glu Arg Ala Phe Arg Gly 
195 200 

Leu His Gin Asn Arg Val Ala His 
210 215 

Leu Gly Arg Leu Met Thr Leu Tyr 
225 230 

Leu Pro Thr Glu Ala Leu Ala Pro 
245 

Leu Asn Asp Asn Pro Trp Val Cys 


425 430 

Ser Gly Thr Ala Leu Gly Ala Pro 
445 

Ala Val Thr Leu Arg 
460 


Ser Arg Leu Leu Ala Trp Val Leu 
10 15 

Ala Pro Cys Pro Gly Ala Cys Val 
25 30 

Thr Ser Cys Pro Gin Gin Gly Leu 
45 

Ala Ala Ser Gin Arg lie Phe Leu 
60 

Pro Ala Ala Ser Phe Arg Ala Cys 
75' 80 

His Ser Asn Val Leu Ala Arg lie 
90 95 

Ala Leu Leu Glu Gin Leu Asp Leu 
105 110 

Val Asp Pro Ala Thr Phe His Gly 
125 

Leu Asp Arg Cys Gly Leu Gin Glu 
140 

Leu Ala Ala Leu Gin Tyr Leu Tyr 

155 " 160 

Leu Pro Asp Asp Thr Phe Arg Asp 

170 175 

Leu His Gly Asn Arg lie Ser Ser 
185 190 

Leu His Ser Leu Asp Arg Leu Leu 
205 

Val His Pro His Ala Phe Arg Asp 
220 

Leu Phe Ala Asn Asn Leu Ser Ala 

235 240 

Leu Arg Ala Leu Gin Tyr Leu Arg 

250 255 

Asp Cys Arg Ala Arg Pro Leu Trp 
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260 265 270 


Ala 

Trp 

Leu 

275 

Gin 

Lys 

Phe 

Arg 

Gly 

280 

Ser 

Ser 

Ser 

Glu 

Val 

285 

Pro 

Cys 

Ser 

Leu 

Pro 

290 

Gin 

Arg 

Leu 

Ala 

Gly Arg 
295 

Asp 

Leu 

Lys 

Arg 

300 

Leu 

Ala 

Ala 

Asn 

Asp 

305 

Leu 

Gin 

Gly 

Cys 

Ala 

310 

Val 

Ala 

Thr 

Gly 

Pro 

315 

Tyr 

His 

Pro 

lie 

Trp 

320 

Thr 

Gly 

Arg 

Ala 

Thr 

325 

Asp 

Glu 

Glu 

Pro 

Leu 

330 

Gly 

Leu 

Pro 

Lys 

Cys 

335 

Cys 

Gin 

Pro 

Asp 

Ala 

340 

Ala 

Asp 

Lys 

Ala 

Ser 

345 

Val 

Leu 

Glu 

Pro 

Gly Arg 
350 

Pro 

Ala 

Ser 

Ala 

355 

Gly 

Asn 

Ala 

Leu 

Lys 

360 

Gly Arg 

Val 

Pro 

Pro 

365 

Gly Asp 

Ser 

Pro 

Pro 

370 

Gly 

Asn 

Gly 

Ser 

Gly 

375 

Pro 

Arg 

His 

lie 

Asn Asp 
380 

Ser 

Pro 

Phe 

Gly 

385 

Thr 

Leu 

Pro 

Gly 

Ser 

390 

Ala 

Glu 

Pro 

Pro 

Leu 

395 

Thr 

Ala 

Val 

Arg 

Pro 

400 

Glu 

Gly 

Ser 

Glu 

Pro 

405 

Pro 

Gly 

Phe 

Pro 

Thr 

410 

Ser 

Gly 

Pro 

Arg Arg 
415 

Arg 

Pro 

Gly 

Cys 

Ser 

420 

Arg 

Lys 

Asn Arg 

Thr 

425 

Arg 

Ser 

His 

Cys 

Arg 

430 

Leu 

Gly 

Gin 

Ala 

Gly 

435 

Ser 

Gly 

Gly 

Gly 

Gly 

440 

Thr 

Gly 

Asp 

Ser 

Glu 

445 

Gly 

Ser 

Gly 

Ala 

Leu 

450 

Pro 

Ser 

Leu 

Thr 

Cys 

455 

Ser 

Leu 

Thr 

Pro 

Leu 

460 

Gly 

Leu 

Ala 

Leu 

Val 

465 

Leu 

Trp 

Thr 

Val 

Leu 

470 

Gly 

Pro 

Cys 









<210> 6 
<211> 440 
<212> PRT 

<213> Artificial Sequence 
<22 0 > 

<223> Description of Artificial Sequence: Consensus 
sequence 

< 220 > 

<22 1> MOD_RES 
<222> (3) . . (4) 

<223> Variable amino acid 

<22 0 > 

<22 1> MOD__RES 
< 222 > ( 6 ) 

<223> Variable amino acid 
<22 0 > 

<22 1> MODJRES 
<222> (9) . . (10) 
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<223> Variable amino acid 
<22 0 > 

<22 1> MOD_RES 
<222> (12) . . (13) 

<223> Variable amino acid 

<22 0 > 

<221> MOD__RES 
<222> (15) 

<223> Variable amino acid 
< 220 > 

<22 1> MOD__RES 
<222> (18) . . (25) 

<223> Variable amino acid 

< 220 > 

<22 1> MOD_RES 
<222> (27) . . (29) 

<223> Variable amino acid 

<22 0 > 

<22 1> MOD_RES 
<222> (31) . . (34) 

<223> Variable amino acid 

< 220 > 

<22 1> MOD_RES 
<222> (36) 

<223> Variable amino acid 
< 220 > 

<22 1> MOD_RES 
<222> (39) . . (40) 

<223> Variable amino acid 

<220> 

<22 1> MOD_RES 
<222> (42) 

<223> Variable amino acid 
<220> 

<22 1> MOD_RES 
<222> (44) . . (50) 

<223> Variable amino acid 

<22 0> 

<22 1> MODJRES 
<222> (52) . . (59) 

<223> Variable amino acid 

<220> 

<22 1> MODJRES 
<222> (62) . . (63) 

<223> Variable amino acid 

< 220 > 

<22 1> MOD_RES 
<222> (66) . . (69) 

<223> Variable amino acid 

<22 0 > 

<221> MOD RES 
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<222> 

(71) . . (74) 


<223> 

Variable amino 

acid 

<22 0> 

<221> 

MOD RES 


<222> 

(76) . . (80) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(83) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(87) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(90) . . (91) 


<223> 

Variable amino 

acid 

<22 0> 

<221> 

MOD RES 


<222> 

(94) . . (96) 


<223> 

Variable amino 

acid 

<22 0> 

<221> 

MOD RES 


<222> 

(98) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(101) 


<223> 

Variable amino 

acid 

<220> 

<22 1> 

MOD RES 


<222> 

(104) . . (105) 


<223> 

Variable amino 

acid 

<22 0> 

<22 1> 

MOD RES 


<222> 

(107) 


<223> 

Variable amino 

acid 

<220> 

<22 1> 

MOD RES 


<222> 

(109) 


<223> 

Variable amino 

acid 

<22 0> 

<221> 

MOD RES 


<222> 

(111) . . (112) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(114) 


<223> 
<22 0> 

Variable amino 

acid 
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<22 1> MOD_RES 
<222> (116) . . (117) 

<223> Variable amino acid 

< 220 > 

<22 1> MOD_RES 
<222> (119) . . (122) 

<223> Variable amino acid 

< 220 > 

<22 1> MOD_RES 
<222> (124) 

<223> Variable amino acid 
< 220 > 

<22 1> MOD_RES 
<222> (127) . . (128) 

<223> Variable amino acid 

<22 0 > 

<22 1> MOD_RES 
<222> (136) 

<223> Variable amino acid 
<22 0 > 

<22 1> MODJRES 
<222> (138) . . (141) 

<223> Variable amino acid 

<22 0 > 

<22 1> MOD_RES 
<222> (143) 

<223> Variable amino acid 
< 220 > 

<22 1> MOD_RES 
<222> (146) 

<223> Variable amino acid 
<22 0 > 

<221> MOD_RES 
<222> (148) 

<223> Variable amino acid 
< 220 > 

<22 1> MOD_RES 
<222> (151) 

<223> Variable amino acid 
< 220 > 

<22 1> MOD_RES 
<222> (154) 

<223> Variable amino acid 
< 220 > 

<22 1> MOD__RES 
<222> (162) . . (170) 

<223> Variable amino acid 

< 220 > 

<22 1> MOD RES 

<222> (175) . . (176) 

<223> Variable amino acid 
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< 220 > 

<221> MOD_RES 
<222> (184) 

<223> Variable amino acid 
< 220 > 

<221> M0D__RES 
<222 > (186) . . (189) 

<223> Variable amino acid 

< 220 > 

<22 1> MODJRES 
<222> (192) . ■ (193) 

<223> Variable amino acid 

<22 0 > 

<221> MOD__RES 
<222> (196) . . (197) 

<223> Variable amino acid 

< 220 > 

<22 1> MOD_RES 
<222> (199) 

<223> Variable amino acid 
< 220 > 

<22 1> MODJRES 
<222> (202) . . (203) 

<223> Variable amino acid 

<22 0 > 

<22 1> MOD_RES 
<222> (205) 

<223> Variable amino acid 
<22 0> 

<22 1> MOD_RES 
<222> (208) 

<223> Variable amino acid 
<220> 

<22 1> MOD_RES 
< 222 > ( 210 ) 

<223> Variable amino acid 
<22 0 > 

<22 1> MOD_RES 
<222> (212) . . (213) 

<223> Variable amino acid 

<22 0 > 

<22 1> MOD_RES 
<222> (215) . . (218) 

<223> Variable amino acid 

<22 0 > 

<22 1> MOD_RES 
< 222 > ( 221 ) 

<223> Variable amino acid 


<22 0 > 

<22 1> MODJRES 
<222> (223) . . (224) 

<223> Variable amino acid 


10 



WO 02/29059 


PCT/US01/31488 


<220> 

<22 1> 

MOD RES 


<222> 

(226) . . (227) 


<223> 

Variable amino 

acid 

<22 0> 

<221> 

MOD RES 


<222> 

(232) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(234) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(236) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(238) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(243) 


<223> 

Variable amino 

acid 

<22 0> 

<221> 

MOD RES 


<222> 

(246) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(248) . . (251) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(253) 


<223> 

Variable amino 

acid 

<22 0> 

<221> 

MOD RES 


<222> 

(257) 


<223> 

Variable amino 

acid 

<22 0> 

<221> 

MOD RES 


<222> 

(259) 


<223> 

Variable amino 

acid 

<22 0> 

<221> 

MOD RES 


<222> 

(261) . . (262) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(264) . . (267) 


<223> 

Variable amino 

acid 
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< 220 > 

<22 1> MOD_RES 
<222> (269) 

<223> Variable amino acid 
<22 0 > 

<22 1> MOD_RES 
<222> (272) . . (273) 

<223> Variable amino acid 

<22 0 > 

<22 1> MODJRES 
<222> (275) . . (277) 

<223> Variable amino acid 

< 220 > 

<221> MOD_RES 
<222> (279) . . (281) 

<223> Variable amino acid 

< 220 > 

<22 1> MOD__RES 
<222> (283) . . (287) 

<223> Variable amino acid 

<22 0 > 

<221> MOD_RES 
<222> (289) . . (290) 

<223> Variable amino acid 

<22 0 > 

<221> MOD__RES 
<222> (292) . . (328) 

<223> Variable amino acid 

<22 0> 

<221> MOD_RES 
<222> (330) . . (341) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (344).. (346) 

<223> Variable amino acid 

<22 0> 

<22 1> MOD_RES 
<222> (348) . . (399) 

<223> Variable amino acid 

<22 0 > 

<22 1> MOD_RES 
<222> (401) . . (428) 

<223> Variable amino acid 

<22 0 > 

<22 1> MODJRES 
<222> (431) . . (439) 

<223> Variable amino acid 

<400> 6 

Cys Pro Xaa Xaa Cys Xaa Cys Tyr Xaa Xaa Pro Xaa Xaa Thr Xaa Ser 
15 10 15 
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Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
20 25 

Xaa Xaa Arg Xaa Phe Leu Xaa Xaa Asn 
35 40 

Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa Xaa 
50 55 

Asn Xaa Xaa Xaa Xaa lie Xaa Xaa Xaa 
65 70 

Leu Glu Xaa Leu Asp Leu Xaa Asp Asn 
85 

Pro Xaa Thr Phe Xaa Gly Leu Xaa Xaa 
100 105 

Cys Xaa Leu Xaa Xaa Leu Xaa Xaa Xaa 
115 120 

Leu Gin Tyr Leu Tyr Leu Gin Xaa Asn 
130 135 

Asp Xaa Phe Xaa Asp Leu Xaa Asn Leu 
145 150 

Asn Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
165 

Leu Asp Arg Leu Leu Leu His Xaa Asn 
180 185 

Xaa Ala Phe Xaa Xaa Leu Xaa Arg Leu 
195 200 

Asn Xaa Leu Xaa Xaa Leu Xaa Xaa Xaa 
210 215 

Leu Xaa Xaa Leu Arg Leu Asn Xaa Asn 
225 230 

Ala Arg Xaa Leu Trp Xaa Trp Xaa Xaa 
245 

Xaa Val Xaa Cys Xaa Xaa Pro Xaa Xaa 
260 265 

Xaa Leu Xaa Xaa Xaa Asp Xaa Xaa Xaa 
275 280 

Xaa Xaa Pro Xaa Xaa Xaa Xaa Xaa Xaa 
290 295 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
305 310 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly 
325 

Xaa Xaa Xaa Xaa Xaa Pro Pro Xaa Xaa 
340 345 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 


PCT/US01/31488 


Pro Xaa Xaa Xaa Pro Xaa Xaa 
30 


Xaa lie Xaa Xaa Xaa Xaa Xaa 
45 

Xaa Xaa Leu Trp Xaa Xaa Ser 
60 

Xaa Phe Xaa Xaa Xaa Xaa Xaa 
75 80 

Xaa Xaa Leu Arg Xaa Xaa Xaa 
90 95 

Leu Xaa Leu Xaa Leu Xaa Xaa 
110 

Xaa Phe Xaa Gly Leu Xaa Xaa 
125 

Xaa Xaa Xaa Xaa Leu Xaa Asp 
140 

Xaa His Leu Phe Leu His Gly 
155 160 

Xaa Phe Arg Gly Leu Xaa Xaa 
170 175 

Xaa Xaa Xaa Xaa Val His Xaa 
190 

Xaa Xaa Leu Xaa Leu Phe Xaa 
205 

Xaa Leu Ala Xaa Leu Xaa Xaa 
220 

Xaa Trp Xaa Cys Xaa Cys Arg 

235 240 

Xaa Xaa Arg Xaa Ser Ser Ser 

250 " 255 

Xaa Xaa Gly Xaa Asp Leu Xaa 

270 

Cys Xaa Xaa Xaa Xaa Xaa Pro 
285 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
300 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

315 320 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

330 335 

Xaa Ser Xaa Xaa Xaa Xaa Xaa 
350 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
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355 





360 





365 




Xaa 

Xaa 

370 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

375 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

380 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

385 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

390 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

395 

Xaa 

Xaa 

Xaa 

Xaa 

Arg 

400 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

405 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

410 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

415 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

420 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

425 

Xaa 

Xaa 

Xaa 

Leu 

Xaa 

430 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

435 

Xaa 

Xaa 

Xaa 

Xaa 

Leu 

440 










<2 10> 7 
< 211 > 66 
<212> PRT 
<213> Homo sapiens 

<400> 7 


Arg 

1 

He 

Tyr 

Lys 

Gly 

5 

Val 

lie 

Gin 

Ala 

lie 

10 

Gin 

Lys 

Ser 

Asp 

Glu 

15 

Gly 

His 

Pro 

Phe 

Arg 

20 

Ala 

Tyr 

Leu 

Glu 

Ser 

25 

Glu 

Val 

Ala 

lie 

Ser 

30 

Glu 

Glu 

Leu 

Val 

Gin 

35 

Lys 

Tyr 

Ser 

Asn 

Ser 

40 

Ala 

Leu 

Gly 

His 

Val 

45 

Asn 

Cys 

Thr 

lie 

Leu 

Lys 

50 

Lys 

Glu 

Leu 

Arg 

Arg 

Leu 

55 

Phe 

Leu 

Val 

Asp 

Asp 

60 

Leu 

Val 

Asp 

Ser 


65 


<210> 8 

<211> 390 

<212> PRT 

<2 13> Homo sapeins 

< 4 00 > 8 


Cys 

1 

Pro 

Met 

Leu 

Cys 

5 

Thr 

Cys 

Tyr 

Ser 

Ser 

10 

Pro 

Pro 

Thr 

Val 

Ser 

15 

Cys 

Gin 

Ala 

Asn 

Asn 

20 

Phe 

Ser 

Ser 

Val 

Pro 

25 

Leu 

Ser 

Leu 

Pro 

Pro 

30 

Ser 

Thr 

Gin 

Arg 

Leu 

35 

Phe 

Leu 

Gin 

Asn 

Asn 

40 

Leu 

lie 

Arg 

Thr 

Leu 

45 

Arg 

Pro 

Gly 

Thr 

Phe 

50 

Gly 

Ser 

Asn 

Leu 

Leu 

55 

Thr 

Leu 

Trp 

Leu 

Phe 

60 

Ser 

Asn 

Asn 

Leu 

Ser 

65 

Thr 

lie 

Tyr 

Pro 

Gly 

70 

Thr 

Phe 

Arg 

His 

Leu 

75 

Gin 

Ala 

Leu 

Glu 

Glu 

80 

Leu 

Asp 

Leu 

Gly 

Asp 

85 

Asn 

Arg 

His 

Leu 

Arg 

90 

Ser 

Leu 

Glu 

Pro 

Asp 

95 

Thr 
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Phe 

Gin 

Gly 

Leu 

100 

Glu 

Arg 

Leu 

Gin 

Ser 

105 

Leu 

His 

Leu 

Tyr 

Arg 

110 

Cys 

Gin 

Leu 

Ser 

Ser 

115 

Leu 

Pro 

Gly 

Asn 

lie 

120 

Phe 

Arg 

Gly 

Leu 

Val 

125 

Ser 

Leu 

Gin 

Tyr 

Leu 

130 

Tyr 

Leu 

Gin 

Glu 

Asn 

135 

Ser 

Leu 

Leu 

His 

Leu 

140 

Gin 

Asp 

Asp 

Leu 

Phe 

145 

Ala 

Asp 

Leu 

Ala 

Asn 

150 

Leu 

Ser 

His 

Leu 

Phe 

155 

Leu 

His 

Sly 

Asn 

Arg 

160 

Leu 

Arg 

Leu 

Leu 

Thr 

165 

Glu 

His 

Val 

Phe 

Arg 

170 

Gly 

Leu 

Gly 

Ser 

Leu 

175 

Asp 

Arg 

Leu 

Leu 

Leu 

180 

His 

Gly 

Asn 

Arg 

Leu 

185 

Gin 

Gly 

Val 

His 

Arg 

190 

Ala 

Ala 

Phe 

Arg 

Gly 

195 

Leu 

Ser 

Arg 

Leu 

Thr 

200 

lie 

Leu 

Tyr 

Leu 

Phe 

205 

Asn 

Asn 

Ser 

Leu 

Ala 

210 

Ser 

Leu 

Pro 

Gly 

Glu 

215 

Ala 

Leu 

Ala 

Asp 

Leu 

220 

Pro 

Ser 

Leu 

Glu 

Phe 

225 

Leu 

Arg 

Leu 

Asn 

Ala 

230 

Asn 

Pro 

Trp 

Ala 

Cys 

235 

Asp 

Cys 

Arg 

Ala 

Arg 

240 

Pro 

Leu 

Trp 

Ala 

Trp 

245 

Phe 

Gin 

Arg 

Ala 

Arg 

250 

Val' 

Ser 

Ser 

Ser 

Asp 

255 

Val 

Thr 

Cys 

Ala 

Thr 

260 

Pro 

Pro 

Glu 

Arg 

Gin 

265 

Gly 

Arg 

Asp 

Leu 

Arg 

270 

Ala 

Leu 

Arg 

Glu 

Ala 

275 

Asp 

Phe 

Gin 

Ala 

Cys 

280 

Pro 

Pro 

Ala 

Ala 

Pro 

285 

Thr 

Arg 

Pro 

Gly 

Ser 

290 

Arg 

Ala 

Arg 

Gly 

Asn 

295 

Ser 

Ser 

Ser 

Asn 

His 

300 

Leu 

Tyr 

Gly Val 

Ala 

305 

Glu 

Ala 

Gly 

Ala 

Pro 

310 

Pro 

Ala 

Asp 

Pro 

Ser 

315 

Thr 

Leu 

Tyr 

Arg 

Asp 

320 

Leu 

Pro 

Ala 

Glu 

Asp 

325 

Ser 

Arg 

Gly 

Arg 

Gin 

330 

Gly 

Gly Asp 

Ala 

Pro 

335 

Thr 

Glu 

Asp 

Asp 

Tyr 

340 

Trp 

Gly 

Gly 

Tyr 

Gly 

345 

Gly 

Glu 

Asp 

Gin 

Arg 

350 

Gly 

Glu 

Gin 

Met 

Cys 

355 

Pro 

Gly 

Ala 

Ala 

Cys 

360 

Gin 

Ala 

Pro 

Pro 

Asp 

365 

Ser 

Arg 

Gly 

Pro 

Ala 

370 

Leu 

Ser 

Ala 

Gly 

Leu 

375 

Pro 

Ser 

Pro 

Leu 

Leu 

380 

Cys 

Leu 

Leu 

Leu 

Leu 

385 

Val 

Pro 

His 

His 

Leu 

390 












<210> 9 
<211> 421 
<212> PRT 
<2 13> Mus sp . 

<400> 9 
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Cys 

1 

Pro 

Arg 

Asp 

Cys 

5 

Val 

Cys 

Tyr 

Pro 

Ala 

10 

Pro 

Met 

Thr 

Val 

Ser 

15 

Cys 

Gin 

Ala 

His 

Asn 

20 

Phe 

Ala 

Ala 

lie 

Pro 

25 

Glu 

Gly 

lie 

Pro 

Glu 

30 

Asp 

Ser 

Glu 

Arg 

lie 

35 

Phe 

Leu 

Gin 

Asn 

Asn 

40 

Arg 

lie 

Thr 

Phe 

Leu 

45 

Gin 

Gin 

Gly 

His 

Phe 

50 

Ser 

Pro 

Ala 

Met 

Val 

55 

Thr 

Leu 

Trp 

lie 

Tyr 

60 

Ser 

Asn 

Asn 

lie 

Thr 

65 

Phe 

lie 

Ala 

Pro 

Asn 

70 

Thr 

Phe 

Glu 

Gly 

Phe 

75 

Val 

His 

Leu 

Glu 

Glu 

80 

Leu 

Asp 

Leu 

Gly 

Asp 

85 

Asn 

Arg 

Gin 

Leu 

Arg 

90 

Thr 

Leu 

Ala 

Pro 

Glu 

95 

Thr 

Phe 

Gin 

Gly 

Leu 

100 

Val 

Lys 

Leu 

His 

Ala 

105 

Leu 

Tyr 

Leu 

Tyr 

Lys 

110 

Cys 

Gly 

Leu 

Ser 

Ala 

115 

Leu 

Pro 

Ala 

Gly 

lie 

120 

Phe 

Gly 

Gly 

Leu 

His 

125 

Ser 

Leu 

Gin 

Tyr 

Leu 

130 

Tyr 

Leu 

Gin 

Asp 

Asn 

135 

His 

lie 

Glu 

Tyr 

Leu 

140 

Gin 

Asp 

Asp 

lie 

Phe 

145 

Val 

Asp 

Leu 

Val 

Asn 

150 

Leu 

Ser 

His 

Leu 

Phe 

155* 

Leu 

His 

Gly 

Asn 

Lys 

160 

Leu 

Trp 

Ser 

Leu 

Gly 

165 

Gin 

Gly 

lie 

Phe 

Arg 

170 

Gly 

Leu 

Val 

Asn 

Leu 

175 

Asp 

Arg 

Leu 

Leu 

Leu 

180 

His 

Glu 

Asn 

Gin 

Leu 

185 

Gin 

Trp 

Val 

His 

His 

190 

Lys 

Ala 

Phe 

His 

Asp 

195 

Leu 

His 

Arg 

Leu 

Thr 

200 

Thr 

Leu 

Phe 

Leu 

Phe 

205 

Asn 

Asn 

Ser 

Leu 

Thr 

210 

Glu 

Leu 

Gin 

Gly 

Asp 

215 

Cys 

Leu 

Ala 

Pro 

Leu 

220 

Val 

Ala 

Leu 

Glu 

Phe 

225 

Leu 

Arg 

Leu 

Asn 

Gly 

230 

Asn 

Ala 

Trp 

Asp 

Cys 

235 

Gly 

Cys 

Arg 

Ala 

Arg 

240 

Ser 

Leu 

Trp 

Glu 

Trp 

245 

Leu 

Arg 

Arg 

Phe 

Arg 

250 

Gly 

Ser 

Ser 

Ser 

Ala 

255 

Val 

Pro 

Cys 

Ala 

Thr 

260 

Pro 

Glu 

Leu 

Arg 

Gin 

265 

Gly 

Gin 

Asp 

Leu 

Lys 

270 

Leu 

Leu 

Arg 

Val 

Glu 

275 

Asp 

Phe 

Arg 

Asn 

Cys 

280 

Thr 

Gly 

Pro 

Val 

Ser 

285 

Pro 

His 

Gin 

He 

Lys 

290 

Ser 

His 

Thr 

Leu 

Thr 

295 

Thr 

Ser 

Asp 

Arg 

Ala 

300 

Ala 

Arg 

Lys 

Glu 

His 

305 

His 

Pro 

Ser 

His 

Gly 

310 

Ala 

Ser 

Arg 

Asp 

Lys 

315 

Gly 

His 

Pro 

His 

Gly 

320 

His 

Pro 

Pro 

Gly 

Ser 

325 

Arg 

Ser 

Gly 

Tyr 

Lys 

330 

Lys 

Ala 

Gly 

Lys 

Asn 

335 

Cys 

Thr 

Ser 

His 

Arg 

Asn 

Arg 

Asn 

Gin 

lie 

Ser 

Lys 

Val 

Ser 

Ser 

Gly 

Lys 
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340 

Glu Leu Thr Glu Leu 
355 

Ser Phe Asp lie Met 
370 

Ala Arg Arg Thr Pro 
385 

Ser Gly Thr Ala Leu 
405 

Ala Val Thr Leu Arg 
420 


345 

Gin Asp Tyr Ala Pro Asp 
360 

Pro Thr Ala Arg Pro Lys 
375 

lie Arg Ala Pro Ser Gly 
390 395 

Gly Ala Pro Leu Leu Ala 
410 


350 

Tyr Gin His Lys Phe 
365 

Arg Lys Gly Lys Cys 
380 

Val Gin Gin Ala Ser 
400 

Trp lie Leu Gly Leu 
415 


<210> 10 
<211> 17 
<212> PRT 

<213> Artificial Sequence 
<22 0 > 

<223> Description of Artificial Sequence: Consensus 
sequence 

< 220 > 

<22 1> M0D_RES 
<222> (3) . . (4) 

<223> Variable amino acid 

<22 0 > 

<22 1> MOD_RES 
< 222 > ( 6 ) 

<223> Variable amino acid 
< 220 > 

<22 1> MOD_RES 
<222> (9) . . (10) 

<223> Variable amino acid 

< 22 0> 

<22 1> MOD_RES 
<222> (12) . . (13) 

<223> Variable amino acid 

< 220 > 

<22 1> M0D_RES 
<222> (15) 

<223> Variable amino acid 
<4 00> 10 

Cys Pro Xaa Xaa Cys Xaa Cys Tyr Xaa Xaa Pro Xaa Xaa Thr Xaa Ser 
15 10 15 

Cys 


< 21 0 > 11 
<211> 50 
<212> PRT 

<213> Artificial Sequence 
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<22 0 > 

<223> Description of Artificial Sequence: Consensus 

sequence 

< 220 > 

<22 1> MOD_RES 
< 222 > ( 2 ) 

<223> Variable amino acid 
< 220 > 

<22 1> MOD_RES 
<222> (4) 

<223> Variable amino acid 
<22 0 > 

<22 1> MODJRES 
< 222 > ( 6 ) 

<223> Variable amino acid 
< 220 > 

<221> MOD_RES 
< 222 > ( 11 ) 

<223> Variable amino acid 
< 220 > 

<22 1> MOD_RES 
<222> (14) 

<223> Variable amino acid 
<22 0 > 

<221> MOD___RES 
<222> (16) . . (19) 

<22 3> Variable amino acid 

< 220 > 

<221> MOD_RES 
< 222 > ( 21 ) 

<223> Variable amino acid 
<22 0> 

<22 1> MOD_RES 
<222> (25) 

<223> Variable amino acid 
<22 0 > 

<22 1> MOD__RES 
<222> (27) 

<223> Variable amino acid 
< 220 > 

<22 1> MOD_RES 
<222> (29) . . (30) 

<223> Variable amino acid 

< 220 > 

<221> MOD_RES 
<222> (32) . . (35) 

<223> Variable amino acid 

<22 0 > 

<22 1> MOD_RES 
<222> (37) 

<223> Variable amino acid 
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<22 0 > 

<22 1> MOD_RES 
<222> (40) . . (41) 

<223> Variable amino acid 

<22 0 > 

<22 1> MOD_RES 
<222> (43) . . (45) 

<223> Variable amino acid 

<22 0 > 

<221> MOD_RES 
<222> (47) . . (49) 

<223> Variable amino acid 

<400> 11 


Asn 

1 

Xaa 

Trp 

Xaa 

Cys 

5 

Xaa 

Cys 

Arg 

Ala 

Arg 

10 

Xaa 

Leu 

Trp 

Xaa 

Trp 

15 

Xaa 

Xaa 

Xaa 

Xaa 

Arg 

20 

Xaa 

Ser 

Ser 

Ser 

Xaa 

25 

Val 

Xaa 

Cys 

Xaa 

Xaa 

30 

Pro 

Xaa 

Xaa 

Xaa 

Xaa 

35 

Gly 

Xaa 

Asp 

Leu 

Xaa 

40 

Xaa 

Leu 

Xaa 

Xaa 

Xaa 

45 

Asp 

Xaa 

Xaa 


Xaa Cys 
50 


<210> 12 
<211> 196 
<212> PRT 

<213> Artificial Sequence 
<22 0 > 

<223> Description of Artificial Sequence: Consensus 

sequence 

<22 0 > 

<22 1> MOD_RES 
< 222 > ( 2 ) 

<223> Variable amino acid 
<22 0 > 

<22 1> MOD_RES 
<222> (5) . . (6) 

<22 3> Variable amino acid 

<22 0 > 

<22 1> MOD_RES 
< 222 > ( 8 ) 

<223> Variable amino acid 
<22 0 > 

<22 1> MODJRES 
<222> (10) . . (16) 

<223> Variable amino acid 

< 220 > 

<221> MOD_RES 
<222> (18) . . (25) 

<223> Variable amino acid 

< 220 > 
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<221> 

MOD RES 


<222> 

(28) . . (29) 


<223> 

Variable amino 

acid 

<22 0> 

<22 1> 

MOD RES 


<222> 

(32) . . (35) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(37) . . (40) 


<223> 

Variable amino 

acid 

<22 0> 

<221> 

MOD RES 


<222> 

(42) . . (46) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(49) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(53) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(56) . . (57) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(60) . . (62) 


<223> 

Variable amino 

acid 

<22 0> 

<22 1> 

MOD RES 


<222> 

(64) 


<223> 

Variable amino 

acid 

<22 0> 

<221> 

MOD RES 


<222> 

(67) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(70) . . (71) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(73) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(75) 


<223> 

Variable amino 

acid 
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<22 0> 

<221> 

MOD RES 


<222> 

(77) . . (78) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(80) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(82) . . (83) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(85) . . (88) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(90) 


<223> 

Variable amino 

acid 

<22 0> 

<221> 

MOD RES 


<222> 

(93) . . (94) 


<223> 

Variable amino 

acid 

<22 0> 

<22 1> 

MOD RES 


<222> 

(102) 


<223> 

Variable amino 

acid 

<22 0> 

<221> 

MOD RES 


<222> 

(104) . . (107) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(109) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(112) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(114) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(117) 


<223> 

Variable amino 

acid 

<22 0> 

<221> 

MOD RES 


<222> 

(120) 


<223> 

Variable amino 

acid 
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<22 0> 

<221> 

MOD RES 


<222> 

(128) . . (136) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(141) . . (142) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(150) 


<223> 

Variable amino 

acid 

<220> 

<22 1> 

MOD RES 


<222> 

(152) . . (155) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(158) . . (159) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(162) . . (163) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(165) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(168) . . (169) 


<223> 

Variable amino 

acid 

<22 0> 

<221> 

MOD RES 


<222> 

(171) 


<223> 

Variable amino 

acid 

<22 0> 

<221> 

MOD RES 


<222> 

(174) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(176) 


<223> 

Variable amino 

acid 

<220> 

<221> 

MOD RES 


<222> 

(178) . . (179) 


<223> 

Variable amino 

acid 

<22 0> 

<221> 

MOD RES 


<222> 

(181) . . (184) 


<223> 

Variable amino 

acid 
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< 220 > 

<22 1> MOD_RES 
<222> (187) 

<223> Variable amino acid 
< 220 > 

<221> MOD_RES 
<222> (189) . . (190) 

<223> Variable amino acid 

< 220 > 

<221> MOD_RES 
<222> (192) . . (193) 

<223> Variable amino acid 

<400> 12 

Arg Xaa Phe Leu Xaa Xaa Asn Xaa lie Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
15 10 15 

Phe Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Trp Xaa Xaa Ser Asn Xaa 
20 25 30 

Xaa Xaa Xaa lie Xaa Xaa Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa Leu Glu 
35 40 45 

Xaa Leu Asp Leu Xaa Asp Asn Xaa Xaa Leu Arg Xaa Xaa Xaa Pro Xaa 
50 55 60 

Thr Phe Xaa Gly Leu Xaa Xaa Leu Xaa Leu Xaa Leu Xaa Xaa Cys Xaa 
65 70 75 80 

Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Phe Xaa Gly Leu Xaa Xaa Leu Gin 
85 90 95 

Tyr Leu Tyr Leu Gin Xaa Asn Xaa Xaa Xaa Xaa Leu Xaa Asp Asp Xaa 
100 105 110 

Phe Xaa Asp Leu Xaa Asn Leu Xaa His Leu Phe Leu His Gly Asn Xaa 
115 120 125 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Arg Gly Leu Xaa Xaa Leu Asp 
130 135 140 

Arg Leu Leu Leu His Xaa Asn Xaa Xaa Xaa Xaa Val His Xaa Xaa Ala 
145 150 155 160 

Phe Xaa Xaa Leu Xaa Arg Leu Xaa Xaa Leu Xaa Leu Phe Xaa Asn Xaa 
165 170 175 

Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Leu Ala Xaa Leu Xaa Xaa Leu Xaa 
180 185 190 

Xaa Leu Arg Leu 
195 


<210> 13 

<211> 1176 

<212> DNA 

<213> Homo sapiens 

<400> 13 

gagggcatcc ccgtggacag cgagcgcgtc ttcctgcaga acaaccgcat cggcctcctc 60 
cagcccggcc acttcagccc cgccatggtc accctgtgga tctactcgaa caacatcacc 120 
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tacatccacc ccagcacctt cgagggcttc gtgcacctgg aggagctgga cctcggcgac 180 

aaccggcagc tgcggacgct ggcacccgag accttccagg gcctggtgaa gcttcacgcc 240 

ctctacctct acaagtgtgg gctcagcgcc ttgccggccg gcgtctttgg cggcctgca c 300 

agcctgcagt acctctacct gcaggacaac cacatcgagt acctccagga cgacatcttc 360 

gtggacctgg tcaacctcag ccacctgttt ctccacggca acaagctgtg gagtctgggc 420 

ccgggcacct tccggggcct ggtgaacctg gaccgtcttt tgctgcacga gaaccagctg 480 

cagtgggtcc accacaaggc attccacgac ctccgcaggc tgaccaccct cttcctcttc 540 

aacaacagcc tctcggagct gcagggtgag tgcctggccc cgctgggggc cctggagttc 600 

ctccgcctca acggcaaccc ctgggactgt ggttgtcgcg cgcgctccct gtgggaatgg 660 

ctgcagaggt tccggggctc cagctccgct gtcccctgtg tgtcccctgg gctgcggcac 720 

ggccaggacc tgaagctgct gagggccgag gacttccgga actgcacggg accagcgtcc 780 

ccgcaccaga tcaagtcaca cacgctcacc accaccgaca gggccgcccg caaggaacac 840 

cactcacccc acggccccac caggagcaag ggccacccgc acggcccccg gcccggccac 900 

aggaagccgg ggaagaactg caccaacccc aggaaccgca atcagatctc taaggcgggc 960 

gccgggaaac aggcccccga gctgccagac tatgccccag actaccagca caagttcagt 1020 

tttgacatca tgcctacggc ccggcccaag aggaagggca agtgtgcccg caggaccccc 1080 

atccgtgccc ccagcggggt gcagcaggcc tcctcggcca gttccctggg ggcctccctc 1140 

ctggcctgga cactggggct ggcggtcact ctccgc 1176 


<2 10> 14 

<211> 392 

<2 12> PRT 

<213> Homo sapiens 

<4 00> 14 


Glu 

1 

Gly 

lie 

Pro 

Val 

5 

Asp 

Ser 

Glu 

Arg 

Val 

10 

Phe • 

Leu 

Gin 

Asn 

Asn 

15 

Arg 

He 

Gly 

Leu 

Leu 

20 

Gin 

Pro 

Gly 

His 

Phe 

25 

Ser 

Pro 

Ala 

Met 

Val 

30 

Thr 

Leu 

Trp 

lie 

Tyr 

35 

Ser 

Asn 

Asn 

lie 

Thr 

40 

Tyr 

lie 

His 

Pro 

Ser 

45 

Thr 

Phe 

Glu 

Gly 

Phe 

50 

Val 

His 

Leu 

Glu 

Glu 

55 

Leu 

Asp 

Leu 

Gly Asp 
60 

Asn 

Arg 

Gin 

Leu 

Arg 

65 

Thr 

Leu 

Ala 

Pro 

Glu 

70 

Thr 

Phe 

Gin 

Gly 

Leu 

75 

Val 

Lys 

Leu 

His 

Ala 

80 

Leu 

Tyr 

Leu 

Tyr 

Lys 

85 

Cys 

Gly 

Leu 

Ser 

Ala 

90 

Leu 

Pro 

Ala 

Gly Val 
95 

Phe 

Gly 

Gly 

Leu 

His 

100 

Ser 

Leu 

Gin 

Tyr 

Leu 

105 

Tyr 

Leu 

Gin 

Asp 

Asn 

110 

His 

lie 

Glu 

Tyr 

Leu 

115 

Gin 

Asp 

Asp 

lie 

Phe 

120 

Val 

Asp 

Leu 

Val 

Asn 

125 

Leu 

Ser 

His 

Leu 

Phe 

130 

Leu 

His 

Gly 

Asn 

Lys 

135 

Leu 

Trp 

Ser 

Leu 

Gly 

140 

Pro 

Gly 

Thr 

Phe 

Arg 

145 

Gly 

Leu 

Val 

Asn 

Leu 

150 

Asp 

Arg 

Leu 

Leu 

Leu 

155 

His 

Glu 

Asn 

Gin 

Leu 

160 

Gin 

Trp 

Val 

His 

His 

165 

Lys 

Ala 

Phe 

His 

Asp 

170 

Leu 

Arg 

Arg 

Leu 

Thr 

175 

Thr 

Leu 

Phe 

Leu 

Phe 

180 

Asn 

Asn 

Ser 

Leu 

Ser 

185 

Glu 

Leu 

Gin 

Gly 

Glu 

190 

Cys 

Leu 

Ala 

Pro 

Leu 

195 

Gly 

Ala 

Leu 

Glu 

Phe 

200 

Leu 

Arg 

Leu 

Asn 

Gly 

205 

Asn 

Pro 

Trp 
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Asp 

Cys 

210 

Gly 

Cys 

Arg 

Ala 

Arg 

215 

Ser 

Leu 

Trp 

Glu 

Trp 

220 

Leu 

Gin 

Arg 

Phe 

Arg 

225 

Gly 

Ser 

Ser 

Ser 

Ala 

230 

Val 

Pro 

Cys 

Val 

Ser 

235 

Pro 

Gly 

Leu 

Arg 

His 

240 

Gly 

Gin 

Asp 

Leu 

Lys 

245 

Leu 

Leu 

Arg 

Ala 

Glu Asp 
250 

Phe 

Arg 

Asn 

Cys 

255 

Thr 

Gly 

Pro 

Ala 

Ser 

260 

Pro 

His 

Gin 

He 

Lys 

265 

Ser 

His 

Thr 

Leu 

Thr 

270 

Thr 

Thr 

Asp 

Arg 

Ala 

275 

Ala 

Arg 

Lys 

Glu 

His 

280 

His 

Ser 

Pro 

His 

Gly 

285 

Pro 

Thr 

Arg 

Ser 

Lys 

290 

Gly 

His 

Pro 

His 

Gly 

295 

Pro 

Arg 

Pro 

Gly 

His 

300 

Arg 

Lys 

Pro 

Gly 

Lys 

305 

Asn 

Cys 

Thr 

Asn 

Pro 

310 

Arg 

Asn 

Arg 

Asn 

Gin 

315 

lie 

Ser 

Lys 

Ala 

Gly 

320 

Ala 

Gly 

Lys 

Gin 

Ala 

325 

Pro 

Glu 

Leu 

Pro 

Asp 

330 

Tyr 

Ala 

Pro 

Asp 

Tyr 

335 

Gin 

His 

Lys 

Phe 

Ser 

340 

Phe 

Asp 

lie 

Met 

Pro 

345 

Thr 

Ala 

Arg 

Pro 

Lys 

350 

Arg 

Lys 

Gly 

Lys 

Cys 

355 

Ala 

Arg 

Arg 

Thr 

Pro 

360 

lie 

Arg Ala 

Pro 

Ser 

365 

Gly 

Val 

Gin 

Gin 

Ala 

370 

Ser 

Ser 

Ala 

Ser 

Ser 

375 

Leu 

Gly Ala 

Ser 

Leu 

380 

Leu 

Ala 

Trp 

Thr 

Leu 

Gly 

Leu 

Ala 

Val 

Thr 

Leu 

Arg 










385 390 


<210> 15 
<211> 143899 
<212> DNA 
<213> Homo sapiens 

<22 0 > 

<22 1> modi f ied_base 
<222> (2044) . . (2144) 

<223> a, t, o r g, other or unknown 

< 220 > 

<221> modified__base 
<222> (6609) 

<223> a, t, c r g , other or unknown 
< 220 > 

<22 1> modified__base 
<222> (6625) . . (6724) 

<223> a, t f 0, g, other or unknown 
< 220 > 

<221> modif ied_base 
<222> (14153) . . (14252) 

<223> a , t, c, g r other or unknown • 

<22 0 > 

<221> modified base 
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<222> (19512) . . (19611) 

<223> a, t, o r g r other or unknown 

<22 0 > 

<221> modified_base 
<222> (22595) . . (22694) 

<223> a, t, Of g, other or unknown 

< 220 > 

<221> modified_base 
<222> (27825) . . (27924) 

<223> a, t , o, g r other or unknown 

< 220 > 

<221> modified__base 
<222> (34953) . . (35052) 

<223> a, t, c, g, other or unknown 

< 220 > 

<22 1> modified_base 
<222> (40783) . . (40882) 

<223> a, t, c, g, other or unknown 

< 220 > 

<22 1> modif ied__base 
<222> (49000) . . (49099) 

<223> a, t, Of g F other or unknown 

< 220 > 

<22 1> inodif ied_base 

<222> (62884) . . (62983) 

<223> a, t r 0 , g, other or unknown 

<22 0 > 

<221> modif ied_base 
<222> (75528) .. (75627) 

<223> a r t. Of g, other or unknown 

<22 0 > 

<221> modified_base 
<222> (87944) . . (88043) 

<223> a, t r Of g r other or unknown 

<22 0 > 

<221> modified_base 
<222> (111030) . . (111129) 

<223> a, tf c f g, other or unknown 

<400> 15 

aagcacatac aggtgacatt acagaactga cagttatgcc aggcactgta cttagcccct 
ataccatcct caaacagctg tatgatgtag attgggtatt aaccccatta ataacaaaag 
tacagggaac aaagtgactt tccaaaggtc atgccattca aaggagggtg aatcttaggt 
tggacgcagg ctgtctgact ctggagtctg aggtgttaat gctgcctcct ccatgggaac 
agcccaagtg aaaaacagct gatccactct tcatttactt ggcatctgtg ctaagctggt 
ccctgagcca agctctgagc aacagaaaca gaagctctgc attaggagct tgtgagcatg 
tcaatgccgg gtaaaggagt gctggaaacc gctgggatgg ccgccgagca ctaggccgtt 
gaaggtgggc tctgtgtgac tggttcctct acactctggc ctggctgcct gcaggaagaa 
gatcaagctg agtgggctgg ccctggacca caaggtgaca ggtgacctct tctacaccca 
tgtgaccacc atgggccaga ggctcagcca gaaggccccc agcctggagg acggttcgga 
tgccttcatg tcaccccagg atgttcgggg cacctcagaa aaccttcctg agagtgagtg 
tctggtcaag gtgccggcct tgggggatag tgatggtggg tcctcatatt cagtgagcac 
tcatggttga gtatttattc gcacccctct tcagtcctta caacacccca tgatgtaggt 
ggggcatgct cctcatttac agatgggcac atcaaagctc agctaacgct gggaagttca 
gattcagggt taccctgctg gattcctggg attggggagg gaggagcttc caaaatgggg 
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acaaggtctc tgggcctgtc gggtagctgg tttcctcagg gccccttgca acctctgagc 960 
ttattgcatc aggtgcagcc aggcccgtga gcctcctggc aggggtcctc cacacctggc 1020 
tgtcttttgc cccctgctgg tcacaggagg agctgcagca cctgcctggg ctgcttctca 1080 
ggagggtaca tgaagatccc aggaccgcca gctccatgat aagtggaagg agctccttgg 1140 
agtcaggagc gggagttgag gagtttgagt cctgctctcc agttataggc tatgtgactt 1200 
gtgtagatca cctaaccttg ctcttgattt ccttacctct taaactagca ctaaaagcac 1260 
ccc acaaact gtaaagttag ttgtgatgat tgaatgacac catgggtgtg gaagctcttt 1320 
gtaaagtgca aaacggtgtg cagtttgagg gtggttaccc ccagtgccga ttctcagagg 1380 
gcaacatggc taagggcacg agctggagtt aggctgacct gctgcttcca gccctgtgag 1440 
cttgagcaag tcatttaact tcctgagctg cagtttcctc atcagtaaaa tgtgataagg 1500 
atagggttgt tgtaagattt tattaaatgg ggtaataaat gtcaagtatg tagcccatag 1560 
tgagtgcttc agagtttttt tcttttgttt ctttcccccc cgccccgaga tggagcctta 1620 
ctctgttgcc caggctggag tgcagtggca tgatcttggc tcactgcaac ctccgcctcc 1680 
cgggttcaag caattctcct gcctcagcct cccaaatagc tgggactaca ggcgtgcacc 1740 
accatgctcg gctaattttt gtatctttag tagagacggg gtttcaccat gttggccagg 1800 
ctggtctcga actcctgacc tcatgatgct cctgcctcag cccccgaaag ttttgggatt 1860 
acaagtgtga gcccccgtgc cc tgccaggt tttttttttt tttttttttt tgtaaaacac 1920 
ccacagggta ttgctgttgc ctgggctgga gtgcggtagt gcaatcatag ttcactgcag 1980 
ccttgacctc ctgggctcaa gtgatcctcc tgcctcagcc tcctgagtag ctgggaatac 2040 
aggnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2100 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnttttgta ttttaagtag 2160 
agacagggtt ttcccaatgt tggccaaggc tggtctaaaa ctcccaacct caggtgatcc 2220 
acccacctca gcctcccaaa gtactgggat tacaggcgtg agacaccgtg cccagccagg 2280 
aggcttattt tcttgataaa ttacccagtc tcaggtattt ctctacagcg atgcaagaac 2340 
agcctaatac atccaggctc agcatcagtg gacccaggtg ggagagctta agatgtcaag 2400 
gtctgaatgc cgcttccaca cacctttggg acctagggac tccctctctt tttctttttt 2460 
cagtagaaga tgttatcttc tcctttctct gaccagtagt- tggtgatggt ttcagagata 2520 
gtttttcagt caagatatat ttcagtggct tcactgagcc caagttccct cgcctctcta 2580 
ggactttatt tccttgtttc tagaagaggg ataacacata ttttctaagg tggttgtgag 2640 
attaagggag ctggtaccgg gtggtgcata aggacaggat agagcaatgg tgagaccact 2700 
caaaaagcga aaagttgacc tgcgagggtg acacttatca aatcagcaca cagtgggagt 2760 
ggaaggaatg tccctcatca gttacaatat ttggagagtg caagttatag aaaacccagc 2820 
cctggccggg cgcggtgggt catgcctata atcccagcac tttgggaggc tgaggcaggt 2880 
ggatcacgag gtcaggagtt caagaccagc ctgaccaacg tggtgaaa cc ccacctctac 2940 
taaaaataca aaattagctg ggcgtggtgg tgtgtgcctg taatctgagc tactcaggag 3000 
gctgaggcac gagaatcact tgaaaccggg aggtggagtt tgcagtgagc cgagatcgca 3060 
ccactgcact ccagcctggg caacagagcg agactccatc tcaaacgaaa aaaaaaaaag 3120 
aaagaaaacc cagctctaac tggcttaaac agtaagaaga tctattatat tatccatctc 3180 
aggcagcagc aagcccagag gtaggggact ccaaggttgg ttgatccagg gcttaacgat 3240 
gtcatcaaag acccaggttc tttctgtctc ggcacctctg tctgcagggc cagcttcatc 3300 
ctaagccaga ttgttcttgt cttgattaca agttggctgc tgggccagca gacgctgcct 3360 
gcctccctgt tcatcttcag aagtagaaag tggcccttcc ccagtcatgg aatgaaagag 3420 
tttcctttct gtctgggatt gcttaggtcc acccacctga agccaatgac tgtcaccagg 3480 
aaggtaatat acactgattg tcttaagtca gggttcctga gccagtcttg ggcaaggagt 3540 
gtgatactgt catgattgtc ttgggctcat cagggcagct ctgcagatga gatcaaactc 3600 
caagctacat tattctgaac agtgggaagt aggaaagaga cattttggga gatacaaaac 3660 
acaatgtcta tcccatatcc ctaggtccag gtcacagtgt cttggttgga catcaaatgt 3720 
agaaaaagaa agactgtcca tccatttatc tacctattca tctggttttt gatttttttt 3780 
aaattttatt ttaagacatt ctcactctgt cacccagact ggagtgcagt ggtttgatca 3840 
tggctcatgg cagcctcaac ctcccaggct caagtgaccc tcccatgctc aagtgatcct 3900 
cctacctcag cctcccaagt agctagaact aaaggtgcat gccaccacgc tcagttaatt 3960 
tttgcatttt ttgtagagat ggggtttcgt catgatgccc atgctagtct ggaattcctg 4020 
aactcaagca atatgcctgc ctttgcctcc caaaatgctg ggattgtagg catgagccac 4080 
tgctcctggc tcatctgttt aataatttat gaaacaacta ctgggtgctg agcacggggc 4140 
caggggctgg agatctagca gggaccaggc agatctctgc caagtcgttg gtttcttaaa 4200 
ggttttgctc ataattcccc ttttcttttc tctttcgttt tttttctttt ctttctttct 4260 
ttctttcttt tttttttttt gagacagagt ctcactctgt tacccaggct ggagtgcagt 4320 
ggtgcgatct cagctcactg caacctctgc ctcctgggtt caagcgattc tcctgcctca 4380 
gcctcccgag tagctgggac tacaggcgcc tgccaccatg cccggctaat ttttgtgttt 4440 
ttagtagaga ctgggtttca ccatattggc caggctggtc ttgaactcct gaccttgtga 4500 
tccgcccgct tcggcctccc acagtgctgg gattacaggc gtgagccacg gcgcccagcc 4560 
agtttccctt ttcaatgagg cctccctgac ctccatactc tactcctcca cctggcccac 4620 
tcagctctac tttttcttcc ccatagcact caagacctcc taacatacta cgtaagttat 4680 
ttatttacta ggcttactgt gtattgtctg tcttcctcta ctagaatgta aactccatga 4740 
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gaatagaaat ttttgccttt ttatttagtg tggtgtctgc agcccctggc ttagtccctg 4800 
gcatacaaca gtcactccac ccacagttgc tgaataagtg actaaaggtc cctgccctca 4860 
tattgttatg agggagtgtg catgttgtta gagaaaaatc tgaggcacaa taaaatttta 4920 
tagagtttaa gttttctttt ttaagcaatc cacgaattgg ggtagtttca gaggtagttt 4980 
ttcagtcatg acgtatttca atggcttcac tgagcccaag ttctttcacc tctctaggac 5040 
tttatttcct tatttctaga acggggataa cacatagttc ataaggcagt tatgagagta 5100 
agggagctgg tatggggtga tgcataagga caggatagag cagtggtgag accgctcaga 5160 
tgacaaagcg tcagagacca gtatttacga cggaaatgtg gaagcatgat aaagaaatta 5220 
tttgggctgg gcacaatgac tcacaactaa taaaactttg ggaggccaag gtgggaggat 5280 
cacttgactt gcagaaggtc aaggctgcag tgagctgtga ttttgccact gcactccagc 5340 
ctggtcaaca gagtgagacc ctggctcgaa acgttatttg attggttaca gttatacagt 5400 
tgccttattt ggtctattcc atttgaaagt tcctagttct ataattttaa gtttgttggc 5460 
tgtttctgat tggttaagct taagttttgt tttcctttaa tacagttaag tgccccataa 5520 
tgacattttg gtcaaggaca gaccacatat acagtggtgg tcccataaga ttataatgga 5580 
gctgaaacat tcctattgtc tatggcgtag tggtcctgat gttgtagcgc aatgcattag 5640 
ttatatgttt gtggcaatgc tggtgtaaac acacctactg cactgccagt gatataaaag 5700 
aatagcacat acagttatat atagtacata atatctgata atgataatac ataactatat 5760 
tactggttta tatatttact atattattta tctttatttt atttttgaga cagagtctca 5820 
ttctgtcacc caggctggag tgcagtggcg cgatcttggc tcaccgcaac ctccgcttcc 5880 
tgggttcaag tgattctcct gcctcagtct cctgagtagc tgggattaca ggtgtgcacc 5940 
atgacaccct gctaatatgt tttgtatttt tagtagagat ggggtttcac catgttggcc 6000 
aggctggtct tgaactactg acctcaagtg atcaccccgc ctcggcttcc caaagtgctg 6060 
ggattacagg cgtgagccac cacgcatggc ctatttataa ttattttaga gtgtacgcct 6120 
tatacttata aaaaaaagct aactgtcaaa cagcctcggg caggtccttc aacagatatt 6180 
ccagaagaca ttgttatcat aggagatgac agctccgtgc atattattgt ccctgaaaac 6240 
cttctagtgt ggaagtggaa gacagtgata ttgatgatag gacccagtgt aggcctaggc 6300 
taatgtgtgt gtttgtgtct ttgcttttaa caagaaagtt- taaaaagtta aaataaaata 6360 
caaaaatttt taaatagaaa aaagctgccc aggaacaatg gctcacacct gtaatcccac 6420 
cattcgggga ggccaaggtg ggtggattgc ttgagctcag gagttcaaga ccagcctggg 6480 
caacatggtg aaaccccatc tctacaaaaa atacaaaaat tagccgggtg tggtggcatg 6540 
cggctatagt tccagctaat cgaggggctg aggtgggagg atcactgggg gggaggtggt 6600 
tgaggctgna gtgagctgtg attgnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6660 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6720 
nnnnatattc ttaaaaaaat ttttttttat ttttgagaca gaatttctct cttgttgccc 6780 
aggctggagt gcaatggcgc tatctcagct cagggcaacc tccacctcct gggttcaagc 6840 
gattctcctg ccttagcctc ccaggtacag gcgcccgcca ccatgctcgg ctaatttttg 6900 
tatttttagt agagatgggg tttcaccatg ttgtccaggc tggtcttgaa atcctgcctc 6960 
aggtgatcca cccccctcgg cctcccaaag tgctggaatt tacaggcgtg agccactgtg 7020 
cctggcctcc tttacatttt tttaaattta attttaattt tttaattttt aatttctcat 7080 
atatatatat ttttaagact agccaagtga agcagtggga gtggaaaagg aactggtttt 7140 
gatcaatagg tgtaaacacc actgcactgg gaccagccta ttttacattc ctgttagcag 7200 
tgatgagggt tcactttctt tgtagcctca acaatatgtg tcgttgccca tctttttttt 7260 
tttttttttt tttttttttg agatggagtc tcactctgtt gcctaggctg gaatgcaatg 7320 
gcatgatctc agctcactgc aacctccgcc tcccaggttc aagt'gattct tgtgtctcag 7380 
cctcctgagt agatgggatt acaggcgt cc accaccacgc ccggctaatt ttttgtattt 7440 
tcagtagaga tggggtttca ccatgttggc caggttggtt tcgaactcct gacctcaagt 7500 
gatccgccca cctcggcctc ccaaagtgct gggattacag gcatgagcca ccgcgcccgg 7560 
cctgcccatc ttttttttgt tatagccatc ctagtggatg taaagttttt ttgtgatttt 7620 
gatttgtgtt tccctactga tcaatgatgt tgagcatctt ttcctgtgct tattggcttt 7680 
tggtatatct ttggagaaag gtctattcag gtcctttgcc cactttaaaa ttaggttatc 7740 
tttctattac tgagatgtaa gagttcttta tgttctagat ataagtctcc tacatatgat 7800 
ttgtaaaaat tttccttcca ttattgggtt gtctttcact ttcttttggt gtcctttagt 7860 
gcacaacagt ttttaatatt gaagtccaat tttctatttt tctcttttgc cacttgtatc 7920 
ttggtgtcat gtttaaggaa ctattgccta atctcaggtc acaaagattt acacctgtgt 7980 
ttccttcttt ccttccttcc ttccttcctt ccttctttcc ctccctccct ctctccctcc - 8040 
ctccctctct ccctccctcc ctccttccct tcctccctcc ctccctcctt ccttccttcc 8100 
ttccttcctt ccttccttcc ttccttcctt ccttcctttg tccttctgac ggaatcttgc 8160 
tctgtcaccc aggctggagt gtagtggcac gatcttggct cactgcaacc tctgcctcct 8220 
gggttcaagc aattctcctg cctcagcctc ctgagtagct gggactacag gcacacacca 8280 
ccatgcccag ctaatttttg tatttttagt agagacgggg tttcaccaca ttggccagga 8340 
tggtttcgat ctcctgacct cgtgatccac ccgccttggc ctcccaaagt gctgggattg 8400 
caggtgtgag ccaccatgcc cggcctgtgt tttcttagag ttttgtagtt ttagctctta 8460 
tagttagatc cttgatccat tttgagttga ttttgtatat agtgtgagat atccacctgg 8520 
tgttgtaaat tgcccagaag tgggtatgct tctaaatctg gctgttaggg attactagag 8580 
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gtgaccaaag tgaatttttt ctttgtttct tttttttttt ggagacagag tctccgtcac 8640 

ccaggctgga gtgcaatggc ttcatcttgg ctcagtgcaa cctctgcctt ctggtttcaa 8700 

gcagttctcc tgcctcagac tcctgagtag ctggtattac aggcgtgtac caccatgctt 8760 

ggctaatttt tgtattttta gtaaagatgc agtttcacct gttggccagg cttttctgga 8820 

actcccggcc tcaagtgatc catctgcctc tacctcccaa agtgctggga ttacaggtgg 8880 

gagccaccgt gcccagtcct tttctcagaa tttatttgtt tttttttgtt ttgtttcatt 8940 

tttgagatag ggtctcactc tgtcagctag gcaggagttc agtggtgtga tcattgctgc 9000 

agccttgaac ttctggactc acgtgatctt cccacctcag cctcctgagt agctaggatt 9060 

acaggcatgt gcttccacac ctggctaatt ttttaatttt ctaggactta tttgtccatt 9120 

cttgcaaagc agggtacaac atgcctatct ctacctacct ctcttccctt caagggactc 9180 

cagccaaaat ccttgaggct ctcgggctga ctgtgggtgc tgttgcctga tctgcctcag 9240 

tcatgctgca tgatcaaaag tgtccgtttt ctgcttcttg gaactttatt cactttgggt 9300 

gtcagtcttc ctctgcagtg tcccaagaac acagaattag accaggaatc tgtgttgcca 9360 

tagtgtgtgg aaagaggcag acttccaact ccgctatgtg ctgttgggtg attgaagctt 9420 

aattttcttt ctatctttct ttcttttctt ttcttttttt ttttttggag atggaatctc 9480 

gctctgttgc ccaggctgga gtgcagtggt gcgatctcac ctcactgcaa cctccgcctc 9540 

ccaggttcaa gcgattctcc tgcctcagcc tcctgagtag ctgggattac aggtgcatgc 9600 

caccatgccc ggctaatttg tgtaatttta gtagaaacag tgtttcacca tattggtcag 9660 

gctggtctcg acctcctcac ctcaggtgat ccacccgcct tggcctccca aagtgtcggg 9720 

attacaggcg tgagccaccg tgcctggcac ttaattttct taatacctca attaccccat 9780 

atggtaaaat gggactagta atccatacct tatagcgctg ttgtgaaaat gaaatgaggg 9840 

taagcagata aaatttcaga ctacggatgg gattgttact acattctgaa cctggctttg 9900 

ctgttatttg ctatgtgacc ttatcttctc tggatctcca ttctttccaa gtctataaaa 9960 

caaagtggac aattgtcaac ctttcttcca aagagcaatg atttaaggat caaatgatgt 10020 

catttaacaa aaatatgaag agctcaacaa atgaggaact cattattatt attacaatta 10080 

ttattatttt agaaataggg tcttgttctc ttgcctaggc tggagtccag tggtataaac 10140 

acagctcaat gcatcttcag cctcctggat acaagtgatc ctcatgtctc atccccctaa 10200 

gtagctggga ccacaggcat gtaccaccac gcacggctaa ttttttattt tttattttta 10260 

ttttttgaga cagtcttgct ttgtcgccca gactggagtg cagcagcgca atcaccgctc 10320 

actgcaacct ccgcctcctg ggttcaagtg attctgctgc ctcaacctcc caagtagctg 10380 

ggattacagg cctgtgccac catgcccggc taattttttt gtatttttgg taaagacggg 10440 

gtttcaccat gttgcccagg ctgatctaga acccctggcc tcaagtgatc cccctttctt 10500 

ggcctcctaa agtgctagga ttacaggcgt gagcctctgc acctggcctc ggctaatttt 10560 

ttattttttg tagagacagg ttctcactat gttgccaggg ctggtcttga actcctgggc 10620 

tcaagtgatc ttcccacctc agcctcccaa agtgctgaga ttacagatgt gagccactgt 10680 

gcctggcctg gaactcatta ttgaagcatt cactagtatc aactttgggg ttacctggcc 10740 

acatcctctg acctacctat aagggtatca cagctaacgg agcctctgtt tctcagaatt 10800 

taggcagaag cagttcaatt tatcacaaac tactctatat ccagcataag tgcccaaata 10860 

aaacaattgc taaagttctt taggcattta ctgtttgtta gttagatatt tagtcctcac 10920 

tacaaatctg tgatacaggt attattttta ttaaccccat tttatagaag agaaacctga 10980 

agctcagaga tgctaagtaa cttgtgcaag gtcacacagc tagtaaataa agggcagagt 11040 

aaagatttag tttcacattg gactccagaa cctttctact gggactcatg ggaatagtgt 11100 

ggatgtccct gaccttcagt ggcccagggc tctcctgggg gaatccagcc atagacaaga 11160 

caccagcgag agcccaatcc taagattttg tttgtttgtt tttgagacaa ggtctcactc 11220 

tgtcaccaga ctggagtgca gtggcatgat caatgctcac tgcaaccttg atctcccagg 11280 

ctcaagcaat cctcccacct cagcctcctg agtagcttgg actacaggtg cacaccacca 11340 

cacctgacta attttaaaat tttatttaat taattactta ctattatttt ttgagacagg 11400 

gtatcacttt gtcacccaag ctggactgca atggtgtggt ctcagctcat tgcgtcctcc 11460 

acctcccagg ttcaagtgat cctcccacct cagcctctgg agttgcaggg actgcaggtg 11520 

tgcgccacta tgctcagcta atgtttttat tttttgtata gatggggtct cactatgttg 11580 

ccagggctag tctcaaactc ttggactcaa gcgatcctcc tgtcttggcc tcccaaagtg 11640 

ccgggattac aggcataaac caccacaccc aacccctaag gtgtttttgc tgaatgtgac 11700 

catgtcagag gcaggaaagg gaagcatcat ggggttagga aaggaacact gagcagggag 11760 

acaaagaaaa tgggatcatt ttgtgagtgt tcgctgtgtg tgtatgtgtg acaattctca 11820 

gagccagcct ctcaggtggt tgagaccaca gtccccattt cccagatgag ataatggagc 11880 

ctcagagagt ttctgcagca cagctagtgg aattagaatt tgaacccggc tcttccagac 11940 

tccaggtgct tcacaaccat cccaaaccta gtcatttgca gtttaccttc atgattttac 12000 

catttccctt tgccatagct agtgttattt acttaataat tccttttgaa tcagtctgct 12060 

taaaaaaaaa tagcttcatt ctaaagtgta atattcttgg aatatcgggt ttgctgttac 12120 

ccacccccac acgttataca tatacatgta tgtttctaat acatatatat gtacgtatat 12180 

acgtgtatcg ttttttgtta ttttttttgt tgttgttagt tttttttaga tggagtctct 12240 

ctctgtagcc caggctggag tgcagtggtg tgatttcggc tcactggaac ctctgcctcc 12300 

tgggttcaag cgattctcct gcctcagcct ctggagtagc tgggattaca ggcacccacc 12360 

actacacccg gctaatgttt gtatttttag tagagacagg gtttcaccat gttggccagg 12420 
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tgggtcttga 

ttataggtgc 

aataactatg 

acatgctgtg 

aatcactgtc 

atgccaaaga 

ttgggctggc 

aagccttcct 

gaatgaatgt 

gtggaagagc 

tggcccatgt 

cttgagtttg 

ggacccttac 

attaaatggg 

tgggattgtt 

gtctaattca 

ggtaaagcag 

gtatgacctt 

tgcctagatc 

tcacccaggc 

tcacgccatt 

gcccggctaa 

atggtctcga 

acaggcttga 

tggattttta 

ataagggtca 

gtgcagcctc 

acagtgctgt 

ataaacctgt 

nnnnnnnnnn 

nnnnnnnnnn 

gggtagatca 

ctctacaaaa 

cgggaggctg 

atcatgccat 

aaaaaaaaag 

attggttcct 

cctggcttgg 

tccaccacta 

tggccagtcc 

atagtgatca 

catttgtggg 

gggctgcctg 

ctgtactttt 

tgccgtggcg 

acctcagtct 

gtatttttag 

tcaggtgatc 

gcccggtcct 

ggcatagttc 

agctcttggc 

gtgcagcaaa 

atttcaattt 

agtgtcccct 

cagaccctgg 

ggagaggatg 

ttgccagccc 

cctcaactgg 

atcatctgcg 

cattaaactt 

catattaata 

gtagattttt 

ggctgtggca 

agcactgcac 


actcctgatc 

gagctactgc 

aaacctgaaa 

cataccacat 

ccatcctatg 

ccttttctgt 

tctcctttta 

tatgaatgaa 

atgaattgac 

acatgtcctt 

aacacagagg 

aatctgagcc 

catttataaa 

attctctata 

tgtgagggaa 

gggactgcaa 

ggcaggtgta 

gagaaggcaa 

ttctaacttt 

tggagtgcag 

ctcctgcctc 

tttttttttt 

tctcctgact 

gccaccgcac 

tgtgataact 

acattgccag 

tggtctaaaa 

tgtatagtaa 

agagaggccg 

nnnnnnnnnn 

nnnnnnnnnn 

cctgaggtca 

ataaaaaaat 

aggccggaga 

tgcactccag 

aaaaaagaaa 

gagagtagcc 

gccaatgaat 

gggaggtctc 

tcctacagtc 

aggccgttac 

tcttttggaa 

tccctgctcc 

tttttttttt 

tgatctctgc 

cctgagtagc 

tagagacagg 

cacccgcaca 

accaatctgc 

tgagaagcat 

agagacatct 

ttttgtctag 

ttaagaaatg 

taatttctta 

ggtgacagaa 

atctacagag 

tgaagctatg 

ggccattttt 

ggggtgggga 

tctacaacgc 

gcacaaagtt 

cacacattaa 

ggaggattac 

tccagcctgg 


tcaagtgatc 

ggctggccaa 

aactgctcca 

attgggaaac 

cttcgcttta 

tgtcattaac 

gttcagaatt 

ctcgttggtc 

actaagtcat 

tctggcactg 

ccctttggtg 

tgcctgtaag 

atggggacca 

gaaaatattg 

attgtcattt 

atggaaatgc 

gaatagcaat 

gccctggcaa 

ttatttttat 

tggcgc:gatc 

agcctcccaa 

gtatttttag 

ttgtgattta 

ctggccgatc 

cctgatttta 

cagagcaaaa 

tcatctttgt 

atatccagta 

ggcacagttg 

nnnnnnnnnn 

nnnnnnnnnn 

ggagttcaag 

tatctgggca 

attgcttgaa 

cctaggtgac 

agaaaaagaa 

aaccttattg 

gaggaagaat 

aagcttgaaa 

aagtagataa 

tctgcaatca 

gagcagagaa 

ctgagccatg 

ttttttgaga 

tcacagcagc 

tgggattaca 

gtttcactgt 

tgcctcccaa 

acattttaat 

cccacaagga 

ggcaaattca 

agcttggtca 

ttgcattaaa 

gtgtgtctga 

gactggtggg 

tgctggagag 

ccttgaggtt 

ctccccagag 

gaggggctac 

ccaggacagc 

gaggaacctt 

agatgagaaa 

ttgagcccag 

gtgacagagc 


cacctgcttt 

tgtatgtttt 

tgttacttcc 

actgctttcc 

tttgtctata 

atggacacag 

tggattttta 

aaactcataa 

tagcatttca 

atgtgtgctt 

cagtggaaaa 

aagctggcta 

ttgtactcac 

gcacaaagta 

gattatcaaa 

cagctgaggc 

gtctgggaat 

aaggggatgg 

ttttattttt 

tcggctcact 

gtagctggga 

tagagatggg 

ccctccttgg 

ttctaacttt 

aactggcacc 

catgggtggg 

atttcttctt • 

agtgggtgta 

ctcatgtctg 

nnnnnnnnnn 

nntagtccca 

accagcctgg 

tgattgcagg 

cctgggaggc 

ggagcaagat 

acaatgaatg 

gtcatcccag 

aatggcaaat 

tccagtgacc 

gtggagggtt 

gactcagaaa 

gaggaaggaa 

gagggctggg 

cagagtctcg 

atctgcctcc 

ggtgcccacc 

gttgggcagg 

agtgctggaa 

tgacaagggt 

tgcctctgag 

aggcttgaga 

gttaaaattt 

atgttattta 

gttgagagcc 

agatgggagg 

actctgtatg 

gggcaaggtg 

gacatttgga 

tgacatctgg 

ccctccacaa 

gatctagatc 

gcttgtgcct 

gaattcaggg 

aagattttgt 


ggcttcccaa 

taatacacat 

tgaacccatc 

ctggcttcca 

aatgttgggg 

ctctaagagg 

tccaactact 

aaggctgatc 

cgggaatgga 

gggaaactta 

ctgttgactt 

actgaattgc 

cctttagggt 

ggtgtaaatt 

gacttaggag 

caggcatttg 

taaaagagag 

cctccactca 

tgagacggag 

gcaagctccg 

ctacgggcgc 

gtttcaccgt 

cctcccaaag 

ttaaagagaa 

caattataat 

ggcaactgct 

gctttacgca 

gaatgaataa 

taatctcagc 

nnnnnnnnnn 

gcactttggg 

ccaatatagt 

tgcctctaat 

ggaggttgta 

tctgtctcaa 

agtgtgaggc 

ccacgaagtg 

gggggtctat 

aggtttttag 

tggtccgttg 

tggcctctca 

tttaggtctt 

gtggaatatg 

ctccgtcgcc 

cgggttcaag 

accacgcccg 

ctggtctcaa 

ttacaggcat 

caccctccac 

ttcaccctga 

catgctggcc 

tgatgttttg 

tcttgaatag 

tcccctgcct 

tgaggagggg 

gagcttttca 

gcatatccta 

aacatgtgga 

tgagtagaga 

taaagagtta 

cacagcacag 

gtaatccctg 

ttacagtgaa 

ctcttaaaaa 


agtgctggga 

tcaaataacg 

ttgagtgctc 

agcccagctt 

ttgggggttg 

tcttggcatc 

cagagtgatc 

gataaaacag 

ttctccgtta 

ctgagctaac 

tggagattat 

tttgcttctt 

tattgcatgg 

tgcacgctag 

caggaacagt 

ctaataattg 

gtgaggacgt 

gctacagtca 

tcttgctctg 

cctcccgggt 

ccaccaccat 

gttagccagg 

tgctgggatt 

gcaagacatc 

ttacaacact 

ggtcaccggt 

ttgtcccagc 

accaatgcag 

acnnnnnnnn 

nnnnnnnnnn 

aggccaaggt 

gaaaccccgt 

cccagctact 

gtgagccgag 

aaaaaaaaaa 

tcatggtagt 

aaatggtacc 

gcctccaccc 

gtcctggacc 

ggctacggag 

gttacttctc 

ctcaccctct 

gggaataaat 

caggctggag 

ttattcttcc 

gctaattttt 

atacctgacc 

gagccaccgt 

tcatgtgcca 

caagtccact 

tctctttaaa 

ttttgcatta 

taaatttctt 

gattctagtc 

agtgttggtt 

tgctgcctgt 

gatcagagat 

gacatttttg 

ccagagggac 

tttgacctca 

aagaaaggat 

tgactcagga 

ctatcatcgc 

aaaaaaagat 


12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 

14340 

14400 

14460 

14520 

14580 

14640 

14700 

14760 

14820 

14880 

14940 

15000 

15060 

15120 

15180 

15240 

15300 

15360 

15420 

15480 

15540 

15600 

15660 

15720 

15780 

15840 

15900 

15960 

16020 

16080 

16140 

16200 

16260 
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gaggacaggc 

gatcacgagg 

aaaaatacaa 

gggaggtgga 

gcaagactcc 

agagagagag 

cttccacttc 

tgcctctgtg 

gtgttgcaca 

cctcttgttt 

tgggattcca 

ttttaggggg 

gaagcaaacc 

gcagtcccag 

tcctttgtcc 

acccagtata 

gactcttgga 

agaagcagca 

agcagacctg 

cggagactgg 

tagactctcc 

taatcaaaca 

acaaagacag 

caatgcctct 

aagtctctcc 

ccccccgctt 

cagctcatct 

tttgtttgat 

tgttttgaga 

tcaccacaaa 

tgggactaca 

gtttcaccat 

cctcccaaag 

ctttcagaag 

ctcacacgtg 

ttcttgtcca 

ctccagtctt 

ccacggtgct 

aatgatgccc 

agttcaaagg 

gaggaggtca 

gacccagctg 

cagggaagtc: 

gcagaagctc 

cattcagtct 

agaatcacct 

atttaggagg 

atgatattgt 

gcctgtgtta 

cctttcaaag 

agttgtggtg 

gtaataaact 

attgtcctga 

ttttaacctg 

agagggaatt 

nnnnnnnnnn 

gttagcacat 

tggaggcaat 

tcaggaaagg 

gatgaggctt 

agtttggatt 

tgacaatgac 

cagaagatgg 

ggggacgatg 


acagtggctc 

tcaggagttc 

aaaattagcc 

gcttgcagtg 

gtctcaaaaa 

gaaagaaaga 

ccatggaacc 

aagtaggggt 

ctatgcccct 

cttcccccca 

ccagcaggag 

cataaataat 

agggtgtggg 

cagagagcag 

cccactctgt 

aaggctgctt 

aggtgagttc 

cctcataggg 

tccccgcagg 

aagacaaatg 

agctcagggt 

attaaacaaa 

ccccaagaag 

tgctcctgcc 

cttccctcct 

ttatttgcac 

ccggtaagat 

ttttttttgt 

cggagtctcg 

ctccgcctcc 

ggcacatgcc 

atttgccagg 

tgctgggatt 

gaaaaaccac 

ctttctccca 

tttgtgggtg 

gtcttcaggt 

ggaactagga 

cccatctgga 

agcaggagag 

aggcttctgc 

ccttagacaa 

caaggtgcaa 

aggtaagtag 

tctgttccct 

ggagttgttg 

gctgaggtga 

tggtctgggg 

aattctggtt 

ctttagcacc 

agcattaaat 

ctcaataagc 

cagctaatta 

aaaagggatg 

cnnnnnnnnn 

nnnnnnnnnn 

agccagaaca 

ggcacttgca 

agagcttcag 

ggaggctggg 

ttctcctact 

acgtttgcgt 

aagataaagc 

tgagtgacag 


atgcctgtaa 

aagaccagcc 

agctacttgg 

agccaaaatc 

gaaaaaaaaa 

gagaaggttt 

ttggcaagtt 

ggtacctggt 

aggatgggtg 

ttgtctccca 

gctactcctg 

aattatgcct 

gtacccaaac 

ggcacagggt 

cctggagcaa 

ctggcccctg 

ttttctggct 

caaacacgta 

ggcaggagtc 

aaatttggta 

ctgttcatgg 

tattccctct 

gacggtggag 

ctttctcctg 

catgcgtctt 

tcatgaggcc 

atcacttgga 

tgttgttgtt 

ctctgttgcc 

cgggttcaag 

accacgcctg 

ctggtctcaa 

acaggcgtga 

ccacctgtta 

ccttcaaaac 

tgacccgaga 

ggaattccta 

agctcttgac 

gagcctagag 

agacttggct 

ctgtcaagac 

ggacttccag 

gatctgccgc 

tagggaggct 

attcatttag 

ttaaaacaca 

agcctgagaa 

agaaccaccg 

ctgcctttta 

ttcatttgta 

gagataatac 

ggtggttata 

agaagcaaaa 

ccgggctctt 

nnnnnnnnnn 

nnnnnnnnnn 

tctagaaggg 

aaggctttgt 

cttcagtgtg 

cagattttgc 

aaggacaaca 

tttggtggct 

actaaagggt 

taaatcattt 


tcccagcatt 

tggccagcat 

gaggctgagg 

ttgccattgc 

aaagatgaga 

tggagtcaaa 

gccttctctc 

ggagatgatg 

agagcttggg 

ccagctccct 

gagttaaggc 

ggacaatgga 

aaggcagtag 

tgagaagact 

tcattctggc 

aagccagagg 

gcccaggcag 

ggaggcctgt 

tgggctaagg 

cctgagttat 

caagaggaga 

atactatgtg 

tagtgtcctc 

ctgggaacag 

tccctttttt 

aggaccacag 

ctcagaactg 

tttgtttttt 

caggctggag 

caattcttct 

gctaatcttt 

actcctaacc 

gccaccgcac 

agatgaaggg 

acacactcgt 

gatctctcct 

ggaaagctcg 

ctgatggcag 

acacaggcag 

ctgacggagg 

aactttgagg 

tgccccaggg 

tacctattgg 

actgcggagg 

gctagtggtt 

gctttctggg 

tttgcattta 

atttaaacaa 

ttagctgtgt 

aaacgaagat 

atgaaaagtg 

attattatga 

gataggatta 

gcctggagac 

nnnnnnnnnn 

nnnnnnnnnn 

gtggtaggag 

tgaagtggcg 

gctggagtgc 

tccaaaagag 

gcaaactatt 

cactcgagct 

gatgaggcag 

gttgggttgc 


ttgggaggcc 

agtgaaaccc 

caggagaagc 

actccagcct 

aagaggaagg 

aagacttaga 

tttctctgaa 

cggagatgag 

agcactgaac 

gggatctcca 

gtgttgttca 

cataacatct 

gccccaggag 

gagcaaactt 

ctcttcccac 

cactgagagt 

gaccagtgta 

ccttaggaac 

gtgatactgg 

ccctcccacc 

aagcaatctt 

ccaggggcta 

gctaaaagac 

tttctgctct 

cctttttcct 

ccttccctct 

taacctggaa 

tttttgtttg 

tgcagtggcg 

gcctcagcct 

gtatttttag 

ttgtgattcg 

ccggcaaact 

ctggtgactg 

ggtgtcggcc 

tacctaacac 

agcgacgttg 

gttacctctt 

acctaggcca 

aggtgatt c a 

atgaggaagc 

aagaagacat 

tgcggactcc 

acctggggga 

ctcaaagcct 

cctcacctgc 

caacaaatcc 

aaggctttgg 

tacctgggca 

atatagcacc 

tttggaatag 

gtattatcat 

agggaggcaa 

tcagaaactt 

nnnnnnnnnn 

nnnnnnnnnn 

tggggattag 

taagtgtgga 

tgggtgtgaa 

cttggtgaac 

gaagagttta 

gccagccagg 

gaagccagtg 

tattgtgtgc 


gaagtgggtg 

catctctact 

gcttgaaccc 

gggcgacaga 

gagagaaaaa 

aattccagtt 

tctcacattt 

ggtgaggggt 

ctccctttcc 

cttcactctc 

gactggggca 

agggccttct 

gcaggtccct 

cattatcagc 

ttccccaaaa 

ggaggtctca 

ggccctggga 

atcatagcta 

agagcagcaa 

attccttttc 

gtttgctctt 

tactagacac 

agtagatatg 

tcatctgggt 

acactcccct 

ttagctgata 

ctttctcttt 

ttttttgttt 

cgatctcggc 

cctgagtagc 

tagagatggg 

cccgccccgg 

gtaacctgaa 

ccccaggctt 

agaagtcagg 

caaggaaatc 

ctggagctgt 

cttcccagag 

ggatctggat 

ggcagaggga 

catggagtcg 

tgttgaagtg 

taaaactttt 

aaagagagta 

cgcatgcatc 

acgacttctg 

ccaggtgatg 

tgttagaaac 

agttgctttg 

aacttcttag 

tcactgggct 

ttcctgtagg 

gtaggtttat 

gaaataaatg 

nnnnnnnnnn 

ngaattctct 

aggttccagc 

ggtggagcat 

gagaggtgaa 

tgtgataagg 

aatcgttcag 

tagacagtgg 

aggagagaaa 

taagctctgt 


16320 

16380 

16440 

16500 

16560 

16620 

16680 

16740 

16800 

16860 

16920 

16980 

17040 

17100 

17160 

17220 

17280 

17340 

17400 

17460 

17520 

17580 

17640 

17700 

17760 

17820 

17880 

17940 

18000 

18060 

18120 

18180 

18240 

18300 

18360 

18420 

18480 

18540 

18600 

18660 

18720 

18780 

18840 

18900 

18960 

19020 

19080 

19140 

19200 

19260 

19320 

19380 

19440 

19500 

19560 

19620 

19680 

19740 

19800 

19860 

19920 

19980 

20040 

20100 
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gctaaattct 

caatcgttcc 

gcagcatgca 

ctgctctagg 

tcaacaatat 

gatccctatg 

aaaagggtat 

cagcactttg 

ggccaacatg 

cacgcctgta 

gcagaggtta 

accctgtctc 

taaactgcta 

aaagaattag 

taatcccagc 

cagcctgacc 

gtggtgcatg 

cgggaggcag 

agcaagaccc 

ctgcattcct 

acctcccatc 

tgtgtgcctt 

aaatgtcttg 

ggtattcatg 

tctaataaac 

gtgtaaagtc 

ctctctctca 

gcaagcatcc 

ggcagcctct 

tccttcccca 

agagaccctg 

tacatgtagt 

atacatatac 

cctacatcaa 

taggtaaaga 

gtgggaggac 

tctcaactaa 

gctcaggagg 

acggtcacaa 

aaaaaaaaaa 

tggaacttct 

gtttaaagct 

nnnnnnnnnn 

nnnnnnnnnn 

tgaactcctg 

tgtgcaccac 

tatatataaa 

ttttagagat 

tcctcctacc 

gagttatttt 

gagcctggag 

agtcacatag 

gtggaaggtc 

caagagcctc 

gcggatcatc 

ccaggagcat 

ctggctgagc 

aaggtggcag 

caacccgctt 

gtgggatgta 

gaaggggtga 

ccagttgggt 

cccctcattg 

ggcactgtcc 


tcacgtgtat 

cagctgaaga 

agttgtccag 

actgggggaa 

tgttggcctg 

agcaagtggg 

aagaaataac 

ggaggccgag 

gtaaaacccc 

atcccagcta 

cagtgaactg 

aaaaaaagaa 

gtcctcaggt 

gatgtcagct 

actttgggag 

aacatggcga 

cctttagttc 

gggttacagt 

tgcctcgaaa 

gctcggcaac 

actgatgtct 

ctgctctggg 

tggtaggcaa 

cctgtgtata 

agaacacagc 

tgtcttgttt 

ctttcttact 

atgttgtgag 

acccagcaag 

cttgagcttt 

agacagaaca 

tttaagctac 

catgtacata 

gaatgtgtca 

ggccaggcat 

tgcttgagcc 

aaaaaaaaaa 

ctgaggtggg 

aactgcactc 

aaaaggaaag 

attctagagg 

ccttcagcat 

nnnnnnnnnn 

nnnnattttt 

ggctcaagca 

catgcctggc 

tatatatata 

agggtcttgc 

tctgcctttc 

taccacacga 

gcaagtgaat 

ttagtgtcac 

tgsggctcag 

tgctccccca 

ttctacatgg 

ggtgaggcac 

tcccctcacc 

agacaggtag 

ctcagggggc 

agagtctggg 

tagtgtccac 

tctactcctc 

gctacatcat 

tcggagatgc 


tatttcagct 

agctgaggct 

ccaggattct 

atggtcccta 

ctacatgtga 

atgggggtgg 

aattaggccg 

gagggtggat 

atctctacta 

cttgggaggc 

agattgcacc 

aacaaaacaa 

gcatttttgc 

tcttttgaaa 

gctgaggtgg 

aaccccgtat 

cagctactca 

gaaatgagat 

aaaagaaaaa 

agcctgatgg 

tccctcgctg 

tgcccagatg 

cctcaaagat 

atcctctccc 

aagggtaatg 

gtttccctct 

gtcttgtcct 

ctgacctatg 

gaactgagtc 

cagatgacgg 

ctcagctaag 

tatgttttgg 

actgtttcag 

ggcaccattc 

agggctcaca 

caggagttga 

aattagttgg 

aagatcgctt 

tagcctgagc 

aaagaaaaaa 

gggatggttt 

ctggggaaaa 

nnnnnnnnnn 

tagagatagg 

atcctcctgc 

ttatttcata 

atttctgtat 

tatgttgacc 

aaagtgttgg 

agcataggac 

ctgaactttc 

tccccccaag 

gggogccccc 

ctttcctgtg 

ctgctgtgaa 

cgctgaggcc 

atacgtttcc 

ggctatgaaa 

gggactcctc 

ttcagaggaa 

tgagtgccag 

cgtcttcggg 

ggactggcgg 

caggtgacct 


aatccatcta 

ctcaaaagct 

aacttagaca 

gtgagatatg 

agagctgtgg 

agtggacagt 

gccaggggtg 

cacttgaggc 

aaaatgcaaa 

tgaggcacga 

actctactcc 

gtaggtactt 

ttatcagttt 

atttgggcca 

gtagatcacc 

ctactaaaaa 

ggatgctgag 

tgcaccactg 

gaaaatttgg 

tggatgggca 

tcatcatctg 

aattggatgc 

cactggggcc 

ttgagtgtgt 

ggatgctact 

cttgatcttc 

tccctttgtt ' 

aagaggccca 

actcatcata 

cagccctggc 

ctatacccta 

gataatttgt 

ttgtctgaga 

caggtacttg 

tctataatcc 

gaccagcctg 

gcacagtggc 

gagcccagga 

aacagaaaaa 

ataggtaaag 

ttcacagtag 

ggttnnnnnn 

nnnnnnnnnn 

gtcttgctat 

ctcagcctcc 

tatatatatt 

ataaataaat 

accaggtctt 

gattacaggc 

atacatccaa 

ccgtctttga 

aactgcatga 

tgctgtctcc 

gagcctcctc 

caagatgctg 

cctgggggtt 

ctacccacag 

gcagggccct 

taggcctggg 

ggctattttg 

ttcctgaccc 

gccatgcagc' 

atcaaggact 

gcctgtacag 


acaactctgt 

agtaacttgc 

ccagaggcca 

tgtcgagttt 

aaagcgccca 

aggagggctg 

gtggctcacg 

caggagtttg 

aattagctgg 

gaatcacttg 

agcctgggag 

tctgccatag 

taaaaatcag 

gaagcggtgg 

cgaggtcagg 

tacaaaaatt 

gcatgagaat 

cactctagcc 

aagatctgac 

gaggctcagt 

cttgacatgt 

tatatgagaa 

tccaatgatc 

actacacctg 

tctaaggtta 

ctctcattct 

tactctgatg 

tgtggtggta 

tgggtgagct 

tgatgctttg 

tctcctgaga 

tactcagcaa 

ctatatttag 

gaatacatca 

cagcactttg 

ggtaaaatag 

acatgcctgt 

gtttgaagct 

gaccctgtct 

atccttgatt 

aagtctgtgt 

nnnnnnnnnn 

nnnnnnnnnn 

gttgcccacc 

tgagtagctg 

tttatatata 

aaatatatat 

gaactcctgg 

gtgagccatg 

aaatgttctg 

agaaaccagt 

gctgggacaa 

ccagggtctg 

accatgggca 

gagtaccttg 

gggggcacag 

agacaaatga 

ggctcacgcc 

cccacccagg 

gtgctctctg 

cactgccctt 

tgttgtgcct 

gcgtggacgc 

ggatggtgac 


aaggcaggta 

ctaagttcat 

cttttaacca 

catatttcat 

aagtgagtta 

gaacacacat 

cttttaatcc 

agaccagccc 

gctggtggtg 

aacccaggag 

acagagtgtg 

ggaggattca 

agaatgtctc 

ctcacgcctg 

agttggagac 

agctgggctg 

cacttgaacc 

tgggagacag 

aacagttgac 

tgtctgccaa 

aggcatttgg 

aacattctgt 

cctccttcct 

gatacttgct 

aattacaaga 

ctctctctcc 

aagcaagcta 

aggaactgag 

tggagacaaa 

caggcttgtg 

tagagtataa 

tagataacca 

tcattttaca 

attaacagaa 

ggaggcccag 

tgagacactg 

ggtgccagct 

ccagtgagcc 

caattaaaaa 

cttgccctct 

tgacagcgct 

nnnnnnnnnn 

nnnnnnnnnn 

aggctggtct 

ggaatacagg 

tgtatattta 

atatatatat 

gctcaagtga 

gcacctaact 

agctgagcaa 

ctctctccaa 

tcagagggca 

tccccttacg 

tgacccagct 

tgactggtgg 

gcgggtcacc 

acagcaacaa 

caccccactg 

taaccctttt 

gcctccgctg 

cccatcctgc 

tctcacctgc 

cccaactcag 

agcaagtggt 


20160 

20220 

20280 

20340 

20400 

20460 

20520 

20580 

20640 

20700 

20760 

20820 

20880 

20940 

21000 

21060 

21120 

21180 

21240 

21300 

21360 

21420 

21480 

21540 

21600 

21660 

21720 

21780 

21840 

21900 

21960 

22020 

22080 

22140 

22200 

22260 

22320 

22380 

22440 

22500 

22560 

22620 

22680 

22740 

22800 

22860 

22920 

22980 

23040 

23100 

23160 

23220 

23280 

23340 

23400 

23460 

23520 

23580 

23640 

23700 

23760 

23820 

23880 

23940 
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caggcagtgc 

agaaccctag 

agggacctgg 

caccttgcct 

ttgcagttga 

cagctttcag 

gggctagaat 

aggcttggca 

atcagggaga 

gctgtcagcc 

ttcaattacc 

gagcaagagg 

ataccatcac 

ggagactgga 

ctggagacca 

ccc aggctgg 

agcgattctc 

gattctgtga 

aggatgggag 

agcctggccc 

ctagcagggc 

ctctctcctg 

tcaggccaag 

ctgaggactt 

cttggttctc 

tgcagatttg 

catgttaaga 

gaaacatgga 

tgttttctgc 

cagggagtcc 

ggttgctacc 

cagtgccttc 

caacttacac 

tccaaaggga 

ctggatctat 

tgggaattga 

ctctcctaac 

cattgttcga 

ttgggctgaa 

gggagggccc 

tcacttcctt 

tagaggaagc 

ggcccccaga 

ttgccttata 

cccctgctgg 

acaataatga 

aggatccttg 

gctctctttc 

tcacactcca 

tctgaccctt 

cctggattat 

tatttttaac 

tggtgtcctg 

ccctgcaagc 

ctctgccaca 

gcatctttgc 

gcccaatctt 

gcctcccttg 

tttttctcag 

ctgctccatg 

tatctccagc 

acagaataca 

ggggctcagg 

gctcagtggc 


ttttcatttt 

gagatcaggg 

cttgcttaat 

tgtttagtcc 

ggaaaccgag 

tggcagcctc 

aggtagagtt 

ttttaaaaca 

gactcagccg 

agggctgcag 

cccatggcct 

gctaagcgac 

ttctgttacg 

caaagcaggg 

ccattgaggg 

agtgcagtgg 

ctgcctcagc 

ctgcttttgc 

aaagagactg 

tggcactgag 

ccttctgtcc 

ggctggcctc 

ctgctgtccc 

ctgatgagag 

aaaccttcaa 

cagatatccc 

aagcacccgc 

gctagagaac 

tgccctgaca 

tgggaacgta 

aaatccatca 

accctgacca 

ctccaggtac 

tggtccatcc 

tttccgggtt 

ggcccagttg 

ctcttcactc 

ggtttcttcc 

atgccttcct 

caggcatccc 

aaaggcagcc 

tgaaatacct 

atggcctccc 

attttcctct 

gcgctgctga 

tgacaaaatg 

accatcactc 

cttcatctgc 

tgttcactcc 

cttgtcacct 

tgtggcagcc 

acaggggctg 

tctcatctgt 

tctggcccca 

gccacagtgc 

atgtactagt 

accttctctc 

gctgtcccat 

cgtttatcac 

ggaatgccac 

acctagagca 

gtggacagtc 

gagtttgagc 

cctgaagttg 


ctctgtgcgt 

gtacctcccc 

aataagaata 

tacaacacct 

gtacagggtt 

cagaacctgt 

ggtatattag 

acaaacactt 

gctgattctg 

tcatctgaag 

ctccataggg 

agagaccccg 

ttctattata 

gaataccagg 

tttttttttt 

cacgatctca 

ctcccgagta 

tctttttgtg 

ggagagaagg 

ccttcctcct 

aggacacatg 

agtgcccttg 

caggagggca 

acggtgtgag 

gtgtgcatga 

cgctgagatt 

tttgatgcgt 

gtgggtttct 

tcgaaggcac 

gaggcagcag 

gaccacgcta 

acctgctgct 

ccaccttcat 

agcacctgcc 

ctccaggaag 

gcctcttcgc 

tctctttttg 

actcagcctg 

gagctttgca 

ctaatgcacc 

tatcctgctg 

ggcttgatga 

ctccatgccc 

tcatgttgga 

gggactctcc 

acaaccatta 

cctgagttca 

tcaagccaga 

atcctcagta 

ccactgctgc 

tttagtctcc 

cagtcacctg 

ctcagagtaa 

gcacttcact 

ttcctagtgc 

ccctgtttct 

cttcttcagg 

gtacaattgc 

taacactctg 

ctcctcgatg 

gtgcctggca 

ctgcgaggtc 

cacttgccaa 

gagcatttgc 


ttacatccag 

attttacaga 

gctaatgcag 

ctttgaggta 

gcacagttag 

gttataaggg 

atatttattg 

ctcatctcat 

agtcacagtc 

ggctgactgg 

ctgctcagga 

tatcttctct 

ggcacagagc 

aggcaggatc 

tttttttatt 

gctcactgca 

gctgggattc 

ttcatccaaa 

aaatccagtg 

ctaccctctg 

ggccgcctgt 

attgtgccgg 

tctgcttatg 

agcttcccac 

gaatcactta 

ctgagggccg 

gtgtcattgg- 

atgggtttcc 

atctgaaggg 

ccagtgactt 

ctgcaagatc 

tgtgggtttt 

ccttcccctc 

ttccaggaag 

tgtttctagt 

cctacccctc 

gttttagttt 

tgggagtc:t c 

accgtgatca 

tctctctgag 

taattgactc 

cactttggtt 

accctgaaga 

cagtccccac 

cctgtgtatg 

tcaagtgctc 

gcctcactgc 

accctggagt 

cagccagcag 

ctctccagtc 

cacatctgcc 

tcaggacata 

aagccaaagg 

cccctctagc 

tccggaagtc 

agaacattct 

tctttccctg 

aacctcccta 

cctatctctt 

gcaggaatct 

catagtaggt 

aatgccatcc 

tatcacacat 

cctcaaggct 


cagcttgttg 

tgaggaaact 

agtgctgact 

gatgcgttaa 

gtcattcacc 

tacacgctaa 

ctgtataaca 

acagtttctg 

tctcatgaag 

ggttggagaa 

cacagcacct 

cacataatct 

aaccctgata 

cttgagggct 

gagacagtct 

acctctgcct 

accatggagt 

cagtccctgt 

gcctccctcc 

ctcctaatgg 

cctcacccca 

agagaggaag 

tctagggcag 

ttcccacctt 

gtgggggata 

agatgaggcc 

gtaggggagc 

catagaaaca 

ggaggggcca 

cccgtactcc 

caaaagctca 

ggcatcacct 

tccctgcctc 

cgcagttctg 

agattgggtt 

cttcctccag 

gtgacctttg 

tatgctgcag 

gagaacccca 

accctctgat 

cccctgttgg 

ctatgtctgc 

aatttcccaa 

ttgcatctct 

tgatggagta 

cgttggtgca 

agcggtctcg 

ttccttgatg 

cagcttctac 

ctagccacca 

ctggccccgc 

agtctcttca 

ctttactatg 

tccccctcct 

tcaagtgtgt 

tctccagata 

actgtcctct 

ctgcccgctt 

gcttattgtc 

gttgacttgc 

tctcagctaa 

ctgttattag 

acaggaggtg 

ggggaccaaa 


ctttctccca 

gaggctagga 

gtgcacttgg 

tatcttcatt 

caagatcaca 

agtcttgtta 

aatcacccca 

acagtcagaa 

acatagtcag 

tctatgtcag 

gctttccctt 

cagacgtagc 

tactgtggaa 

gtcttgttgg 

tgctctgtcg 

cccaggttca 

cttgaaccca 

ttatcctaag 

ctgctagcag 

tgagggtccc 

gcctactgac 

cgctggacag 

ggacaccttc 

ccttcccatc 

tttgtccaaa 

tgtgaatctg 

aacactttga 

tggatttctg 

ggccaagaac 

tcagggacgg 

ccaatgccat 

gtctcatcaa 

ccgaggctcc 

gtcttctgat 

ggcgaggggg 

cctccacaca 

tcctgcacac 

tgtgagtctg 

gggaagggtt 

ggcagggagc 

agtcttccct 

tgtttgaaac 

gggcagccat 

ctcctggttt 

acaggacatt 

ggcagcaggc 

gcagagggca 

tttctctccc 

acaccccaaa 

acatctctag 

tgtctcagtc 

catcactctg 

gcctaaaaag 

ccattgttca 

tccctgcttg 

tctgcaaggt 

tctcagtgag 

ctctgcttgg 

tgaccgccac 

ttgatcgtgg 

atgtttgttg 

tggaggaagt 

tgagaaccca 

gagcccatgc 


24000 

24060 

24120 

24180 

24240 

24300 

24360 

24420 

24480 

24540 

24600 

24660 

24720 

24780 

24840 

24900 

24960 

25020 

25080 

25140 

25200 

25260 

25320 

25380 

25440 

25500 

25560 

25620 

25680 

25740 

25800 

25860 

25920 

25980 

26040 

26100 

26160 

26220 

26280 

26340 

26400 

26460 

26520 

26580 

26640 

26700 

26760 

26820 

26880 

26940 

27000 

27060 

27120 

27180 

27240 

27300 

27360 

27420 

27480 

27540 

27600 

27660 

27720 

27780 
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aaagagcccg 

nnnnnnnnnn 

nnnnnnnnnn 

accacaatta 

cctgcaatcc 

agaccagcct 

ggtgtggtgg 

aaatcatgcc 

aaataaataa 

taaaaacacc 

gggcaagcgg 

agccctgtct 

ccagctcctc 

gagccaggac 

caaacaaaca 

aaagcagagc 

tgaatccaaa 

ctctgcccac 

tgggccagtt 

tgatgaccac 

gtgggggaga 

aaagaagttc 

agaagtttag 

cctagagagc 

atctctgaaa 

cattcatgtt 

tgctctaggt 

gggaaaaggc 

cacgtgcaac 

tcaaccccag 

tcttatggaa 

gataattaat 

gaccttggtg 

atgtgtggag 

gcccaccctc 

caaactcctt 

aacggtatgc 

ggtcctttga 

ggtcccctgg 

cactttgtga 

ccaaaggctg 

tgattctgta 

catctgagat 

cagggagacc 

ggatgatgac 

ctcactcggc 

ccttgattcc 

acgtggagcg 

ggtggggaca 

gagacgcagc 

ctctgccccc 

cctcagagag 

cacctctgaa 

ggctcatgcc 

ggagttcaag 

ttagctaggc 

aatcgcttga 

gcctgggtga 

gaaacagtga 

tgtactcaag 

aacccaaggc 

gttgacgtcc 

tggaaataag 

cagagcaggg 


aacgcttaag 
nnnnnnnnnn 
nnnnnnnnnn 
cttttgcacc 
cagcactttg 
ggccaacatg 
cgcgtgcctg 
actgcactcc 
ataaatattt 
catgggaggc 
ggaggatccc 
ctacaaaaaa 
gggaggcaga 
caagacacta 
aacaaaagcg 
ttggagggct 
gttaattccg 
tccagccact 
gagggtgcag 
gaaggccaca 
ctgcctggcc 
tcttctcccc 
agtcagactc 
tatgaaatag 
tttccattat 
catgttcgca 
ttaccgggcc 
aaacccagac 
aactgggggg 
ggcaggtaag 
gaagctgatg 
gtctgctctc 
ccttttcctc 
gtaagatctg 
catgcctgcc 
tggagaca c t 
agtttggtac 
tatttgacag 
cttcgaggta 
ccttggacca 
aataaagcca 
ggtgaagcta 
ctttctggaa 
agctgagaaa 
agagaaagga 
cccacccggt 
tgggaacgta 
caggttccca 
ccgtgaccct 
agaaaag ccc 
tgcctacaca 
acattttgtt 
aaacacatgg 
tgtaatccca 
accagcctgg 
atggtggcat 
acctgggagg 
cagagcgaga 
tcttgtccaa 
ttcacagagc 
tgtgaccatg 
cagcccagtt 
ggagggaaGt 
tcagaaccca 


caccaccctg 

nnnnnnnnnn 

nnnncccact 

cacataaata 

ggaggctgag 

gtgaaacccc 

taatcccagc 

agcctgggcg 

accatgtttt 

tgggcacagt 

ttaaggccag 

taaaaaaatt 

gggagaggtt 

cactccag cc 

acctgtgggt 

caagcgattt 

tctatcatat 

cacgtgtaat 

ttcagaaaga 

ccggcatcgt 

ccagacccca 

ttgtccttgt 

agaagaccca 

tctccacaaa 

cttaaaaaaa 

caaaaatgtc 

taatcgtgtt 

tgctcatgct 

gttggggaga 

ttctcaattg 

caccatctgt 

ccctcaggct 

cctaggggca 

gctcttcaaa 

caccagaaca 

gagcaaaagt 

tgcccaccgt 

aggaaattga 

caccaggttg 

aaccacccat 

gagagtcagg 

aaccactgaa 

gtcacgtgag 

gcttgctgtg 

gagcggagaa 

tcttgtgtca 

atggcagaag 

aacatgttgg 

tgcccagcat 

caaggtaaaa 

gtcacagtg c 

aattttgttt 

aataggtgct 

gcactttggg 

acaacatggt 

gcgcctgtaa 

tggaggttgc 

ctatgtcccc 

cccatttgag 

acattaatgg 

gctgtgtccc 

tctgcctaat 

gagccagggc 

caccaggaga 


cctggccagc 

nnnnnnnnnn 

gcgcctggcc 

gttaccatgg 

ccaggcggat 

gtctccacta 

tattcaggag 

acagaatgag 

gaccacctgt 

ggctcacgcc 

gagttcaaaa 

aactgggcat 

cgcttgagcc 

tgagtgacag 

aggtaggaac 

gccaagcaga 

caccacagcc 

tgacttcctc 

taaggcaggc 

agatcagctg 

ccaaggtaga 

gcccaatttg 

tcattccttt 

gcctgaagtt 

tacatacatt 

catttcatag 

tatcctgccc- 

cagccttttc 

gggaggtgca 

aatgagattc 

cctcttgtcc 

cctgctcaaa 

ggacagagct 

cactacttca 

actgatggcc 

accttcttta 

ggaggttgag 

ggcccccatc 

acctgtccag 

cctctctgga 

gaccttgaac 

ggtttttcag 

tgacagagta 

gcccaggaga 

gtcagacccg 

agacaaaaag 

aggcgtaaga 

gcagctgagg 

tcccagccct 

caaacagaaa 

cctttagctt 

aattccaggt 

taagaaacac 

aagccgaagc 

gaaaccccat 

tcccacctac 

agtgagccga 

accccccaaa 

atgagacaat 

ctttctcccc 

ggtaataggc 

caggacaaat 

agacagtctc 

gaatttaatt 


ggggnnnnnn 

nnnnnnnnnn 

cattactttt 

gctgagcatg 

cacttgaggc 

aaaaatacaa 

gcagaggttg 

actctgtctc 

tatgtgccaa 

tgtaatcgga 

ccagcctagg 

ggtggtgtgt 

cagcagtttt 

agcaagacac 

aggctcatag 

ggtccaagcc 

ctctctgccc 

aggggcagga 

caggccagac 

tgagaggagg 

tcccaggcct 

cagatgagga 

ttctttttca 

gctggccact 

tttgcctatg 

tacgtacaaa 

cttcctggca 

tcacctttcc 

agtgctctgc 

tgtgcaaatg 

tccccatacc 

cctttttctc 

tcaaagggcc 

gttgaaaaga 

cccccaccca 

gtactctttg 

gagcatggca 

ttgcactgag 

gatccagcct 

aggtgtggaa 

gcatgtgaag 

cagtgtgtga 

cagagaaaaa 

gagggggaag 

tgggtcagca 

aaaacccagg 

gccaatcatg 

agggaaagca 

gctccataca 

aatcaatgtg 

caaaaaggct 

ttcccaagtt 

tgatcttggc 

tggtgggaag 

ctccaccaaa 

tccagaggct 

gatcgcacca 

aaaaaagaaa 

tgagacccag 

attgtcgttg 

agtgcctctt 

cacatcctgg 

cagaggaggt 

gatcatgtgt 


nnnnnnnnnn 

nnnnnnnnnn 

aatggcaaaa 

gtggctcagg 

caggagttca 

aaattagctg 

cagttcactg 

aaaaataaat 

ctgtattact 

cactttggaa 

taacacagta 

gcctgtaacc 

aggttgcagt 

tgcctctaaa 

tacagatgag 

gaggtctctc 

cagggagagt 

aaggcttcga 

caggtgaaca 

gggcaggccc 

cagaggcctt 

aaccaagacc 

cttgaggccc 

ggctcaaaat 

actccacaaa 

ggaaacttag 

cattccccag 

caggtcctcc 

ccaagggctc 

tgtcagccct 

atctgaccag 

tgcagtcttg 

acacccccaa 

agggagaact 

tgcgctctct 

taaagtgcaa 

tggctcaaag 

ctaaaacttt 

ggcataaact 

aaatgtggcc 

gggctggact 

gccagttccc 

gaatcagagg 

gcctgcattg 

ctagctgctg 

tggcctcata 

agggccatta 

gcacccatga 

atagctccag 

ggaaactgta 

cccagacacc 

tgttacgtaa 

tgggcgcagt 

cttgaggtca 

aatacaaaaa 

gaggcaggag 

ctgcacttta 

agaaaagaaa 

ggaggaaaag 

tcccagccct 

aaccctctcg 

gaggtgaggg 

ggctctgacg 

tccactcacc 


27840 

27900 

27960 

28020 

28080 

28140 

28200 

28260 

28320 

28380 

28440 

28500 

28560 

28620 

28680 

28740 

28800 

28860 

28920 

28980 

29040 

29100 

29160 

29220 

29280 

29340 

29400 

29460 

29520 

29580 

29640 

29700 

29760 

29820 

29880 

29940 

30000 

30060 

30120 

30180 

30240 

30300 

30360 

30420 

30480 

30540 

30600 

30660 

30720 

30780 

30840 

30900 

30960 

31020 

31080 

31140 

31200 

31260 

31320 

31380 

31440 

31500 

31560 

31620 
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tgcctcagcc 

ctcctctggc 

gatccgtaca 

ctcaacagct 

cagcactttg 

gagcaacacg 

cgtgcacctg 

aggcagaggt 

agactccgtc 

ccttccccct 

ttttcctgca 

gcctggagag 

taccgtgggt 

ggcaacagca 

agagcagcgg 

gccagcgccg 

agccgctgga 

agggatttag 

gggagaaagg 

ggcggctgct 

gagataaaca 

taacactcct 

cacacataca 

aatttcatat 

agggacgggg 

gctgctccct 

cgactgacat 

aatgtaacct 

ctagaacaga 

acaccccgac 

cctgcacagt 

cagcccactc 

tgctgccagg 

ccaaagccat 

gcagcagatg 

tccgccttca 

tccacgtccc 

agacacaggg 

acggtccctg 

ggtcgggggc 

aaggaaccaa 

ctgcgctgct 

gctcagcaca 

cctgctttca 

atggggctgc 

cacagcctga 

tgtaatatgc 

catatctaga 

attaaaatgg 

caggcagatc 

tctctaaata 

taaatgatag 

cactgttata 

aatgagaacc 

taacggtaaa 

acaaagggct 

nnnnnnnnnn 

nnnnnnnnnn 

ttccttcctt 

gcacttggtg 

ggctgtcaaa 

ataaaataag 

tctgctgaat 

ttatgaagca 


aagccctcag 
cccaccatcc 
aagcctaaac 
ctggtaaaaa 
ggaggccgag 
gtgaaacccc 
tagtcccagc 
tgcagtgagc 
tcaacaacaa 
tcccctgaag 
ccctgcatcc 
gggcagggcc 
aaggcgaggc 
gctgggcccg 
cccagggggc 
gaccctgcca 
caagccccac 
ggccctgggc 
gatccgcttc 
ggggagaggt 
gtccgagcca 
aagcctttaa 
ccactcacca 
ggttcaacct 
tcacaca ccc 
ccccctcaag 
aatttattag 
ccagcaggga 
gctccatgga 
acacagcctc 
ttctcacaca 
atactgccct 
ccaggcctgt 
tcagggacag 
agggcatttg 
tcccacctat 
gggaggccag 
aagggcgagg 
tcctcccgtt 
caaagcggtc 
tggtgaagcc 
catcctgggt 
tgacaccagg 
aatcccatgc 
tccagaaatg 
ggcttcaact 
actcagtgtc 
ggcaaaggca 
cccggcgtgg 
gcttaagcct 
aaaaaaagaa 
tctgataaaa 
agtcagttaa 
aggcactgcc 
ctgccaactt 
gcttcgcttt 
nnnnnnnnnn 
nntccctccc 
ccctccatct 
aaagtttcaa 
ggagacttac 
ggccttttgc 
cagatagcta 
ctccaaagtt 


ggcaggggaa 

tccccaagtc 

acactccaca 

aggcgtttag 

gc:gggtggat 

gtctccacta 

tactcgggag 

cgagatcgcg 

aaaaaaaaaa 

tggctggctc 

gtgaggcacc 

ccctcccctc 

cggccggcta 

caagactcag 

ggggccggcc 

ttggctggag 

ccggccgcca 

caagttgcac 

cttcctttag 

gaaattcagc 

gcccggccag 

cgcgtttaca 

catgtaatag 

agtacactca 

actctcccac 

atcatgttac 

tttacttgtt 

ggatgactcg 

cattcatggg 

atccacacac 

ctcacttgat 

caccccactc 

gacactcacc 

cgccaggaat 

gggagctgtg 

tcttttcttg 

ggccatgagg 

ggttggcctg 

ccccatagcc 

catgaggatc 

caggttgagc 

ggtgttggtg 

aacagctggg 

caagtgcctt 

gcagccatta 

agctcaaatg 

caagcttagg 

aaggcagtga 

tagttcatgc 

acaagtttga 

atttagaaag 

taatagctaa 

taaagtatcc 

ctccagtccc 

ggagttgtga 

actcaatgct 

nnnnnnnnnn 

tccctccctc 

tctccacacc 

ttctcctgtg 

ccaatctttc 

atctgattta 

aggggg ccc t 

tgagacacgc 


ggcaaagtca 

actagatccc 

gagggggaaa 

gccgggcgca 

tgcctgagct 

aaatacgaaa 

gctgaggcag 

ccactgcact 

atggtgttta 

aggaaaaacc 

actgccaagg 

caaggggcca 

accccgggct 

cacgggacgt 

agagggagcg 

attacaggag 

gggagggtct 

agcagggaga 

agctgtgaaa 

caaaaccacc 

gaaccttccc 

cactcacata 

gtcaagccat 

caaacacacc 

gacatggcaa 

ccagttttat 

tattgggtta 

gtcagtcctg 

ctctgtacac 

acacagcctc 

ctagtgatct 

actctgccct 

gtgagtgaag 

atcaacggag 

ggaagccaag 

tggggccaca 

gtgcaggacg 

tgagcacccc 

cagccacctc 

cccagtggca 

atctcgtcct 

ctgctctcag 

cacaggagac 

tgggggtacc 

gtacctgacc 

aaatactgga 

gggttgtgga 

gtggtactct 

ctgtaatccc 

gaccagcctg 

aacactaaaa 

tacttattga 

cgttccctag 

ctctagaagt 

caagttaagg 

cannnnnnnn 

nnnnnnnnnn 

cctcccttcc 

tggtatcatc 

tggagaggtg 

cagcccacca 

agtaggaagc 

ggaatctgca 

ctcaaaggtg 


ggatgccctt 

acagctgaga 

ctgagactct 

gtggctcaca 

caggagttcg 

aaattagccg 

gagaattgct 

ccagcctggg 

aacacatata 

tctacccact 

acgccaggga 

caaacgctgt 

ggcggccttg 

cctcgtccaa 

gggagaggct 

gcggggacat 

gaggtcaaga 

aggggctgcg 

tgtccccggt 

cagtcaggca 

ctccaacctc 

aataaacaca 

gtgcacgacg 

taccaactca 

gcgtgcacac 

tttcttccca 

tctgtgcccc 

attgtgctgt 

acaaacacac 

acacctgctc 

gcgtccacag 

caccccactc 

tgaacgttag 

acagagctgg 

ggcgggagct 

agaggacaga 

cagtgaagca 

ccctcccctc 

acctgccaac 

gggtggtggc 

gctggtcaca 

ctgaggaggg 

agcagcccac 

ctagagtcac 

ctgggagagt 

cataaaagta 

cccccaacaa 

aatggctata 

accactgtgg 

ggcaacatgg 

cttagaggaa 

gcacttaact 

gtgatgaagc 

ccacttggag 

agaaaagcta 

nnnnnnnnnn 

nnnnnnnnnn 

ttccttcctt 

atacagaagc 

agcactgagg 

atcccttgcc 

tgattcctga 

ttttagcaag 

gagtggttct 


cgcacacacc 
aggaccttag 
gaagggaggc 
cctgtaatcc 
cgaccagcct 
ggcgtggagg 
tgaacctagg 
cgacactgcg 
actaaattat 
caggcagagg 
aggctgccag 
ctgcgcccag 
cagcgtgcgt 
gtctgggcca 
gaggggcggt 
agcagggagg 
gccggagaga 
cagaggggcg 
tggaattaaa 
gcccttctca 
cctaagcctt 
ctttgagcaa 
aggtgtcgac 
tggctttcac 
gctatctcaa 
gcacctatga 
tcacccccaa 
agtccaggac 
acattaacat 
tttgcagcca 
gcccct cccc 
gggggaactc 
gcagatgcca 
aaaggggaaa 
ggggtaaaca 
caactcacct 
ggcactgtgg 
cccctgcagc 
cagccgcacg 
gctgagcacg 
gcctggccac 
ggaagggagg 
agtcaggcgg 
atctcctctg 
cttgtgcaca 
tttactaagt 
gaagtgcccc 
acaagaattc 
gaggctgaga 
taaaacccca 
gctttcccga 
atgctccagg 
tgaggcacag 
gacttgtcct 
gtgataggag 
nnnnnnnnnn 
nnnnnnnnnn 
ccttccttcc 
agagaggact 
aaggggtggg 
cagtgtttct 
gcccctcaga 
cggaggtggg 
gtggggggca 


31680 

31740 

31800 

31860 

31920 

31980 

32040 

32100 

32160 

32220 

32280 

32340 

32400 

32460 

32520 

32580 

32640 

32700 

32760 

32820 

32880 

32940 

33000 

33060 

33120 

33180 

33240 

33300 

33360 

33420 

33480 

33540 

33600 

33660 

33720 

33780 

33840 

33900 

33960 

34020 

34080 

34140 

34200 

34260 

34320 

34380 

34440 

34500 

34560 

34620 

34680 

34740 

34800 

34860 

34920 

34980 

35040 

35100 

35160 

35220 

35280 

35340 

35400 

35460 
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gaaaggaaaa 

tggaaagggt 

caagccagac 

cgaagcagac 

cgtggagacc 

ctcatccccg 

gtccatcact 

ttctacccca 

cctttgctat 

ctcgcccctg 

gaccccgcca 

ggattgcgcg 

cgcatatatg 

aggcttcgaa 

cgcaccctct 

ccagctcgct 

cagacgccct 

ggccagaggc 

ggctgcagat 

cgggcagagc 

accctcggag 

cgcacgtggc 

actattcgca 

agccagccga 

aaaaagtcag 

ctttgccaag 

tctctcttat 

aatgctgtct 

actcttggcc 

gctatccttt 

cccttgggtg 

caacaccaac 

atcctaaacc 

tgtcccagag 

ctcttgtcct 

ttatttattt 

aaggaagctc 

actttcggca 

tctaaagcat 

aaatcctggg 

gttgggaaca 

ctagttaatg 

caccaaccgc 

taagacttgc 

gacccccaac 

a ggg c tgtgt 

gcacatggga 

accgctagaa 

aacccctcta 

gaaggttttt 

taaatctacg 

cttttggttg 

gcggtcacct 

tgctggagcc 

aatagcagga 

agtcatctgc 

agccagaata 

ttaatgttca 

agggcctggt 

cactgtagga 

tctgaccttt 

agacacagtt 

gacacgcggg 

gcaaataatg 


tgcaaagggg 

gatgccaggt 

cggggaggag 

acttgcaggg 

ctggagagcc 

ggcaccagcg 

cccccggcca 

cagtccccag 

ttgcgatccc 

ggagggcggc 

ccccgtcccg 

gcagggtctg 

cccccctttc 

attcctcgtt 

cccctcccca 

tccgcccctg 

tctcctagga 

aagcttcgag 

aacgcttgca 

aagccccggg 

ccagccgcag 

gcttaccccc 

aattgaggat 

acccgctggt 

tgttggaagt 

tctccagacc 

taacttctca 

tacagagcca 

ccaggagcag 

ccacagactg 

tgtgccttaa 

ctcttaggct 

aatgttagtt 

gcacaagcag 

cactgatggt 

actttaaagc 

aaaatttgca 

ggccctaggg 

tctgtcctaa 

aagtaggggg 

gggaggtgag 

cccagctgag 

caaagacagg 

aagcagcagc 

ccctaaccca 

ctctctcaca 

cctcaggggt 

ggagataaga 

ggatagggac 

tttcctcctt 

gccatggctc 

ctcaggcctt 

cagagccgct 

gggcacggta 

ggcccagatt 

ccttagcctc 

aatacagcta 

gcaattctgc 

gggtgcccac 

gtctgggctg 

ggagatgtta 

caaggaactg 

ctctgactcg 

gcaattctta 


gaaggggtca 

gtggggagta 

ggggccacag 

tgcaccccgc 

cccaaccttg 

gccttcccgg 

gagcccacca 

tcctagctca 

agaactcgtt 

cccattaacc 

cggcgggggt 

ccacagggca 

tgggaaaaga 

ccctatcctc 

gccatctgtt 

cccagccccc 

ggccggaatt 

accccccacc 

aggacgggag 

aagaggcagg 

ccacgcagcg 

acccccgggt 

cccggacaca 

gggtgctagc; 

cgggagtctg 

cctgaggaca 

agagaaacaa 

acctttggag 

ctgagaaccg 

ccgaggttcc 

cctctctggg 

atagtttgga 

ttcccttcat 

gtgcagggct 

gtcttctctt 

ctcactttaa 

taaagtttta 

atgctatggt 

tcctctgtat 

caggagctgg 

ccagacagcc 

cctgggtgaa 

cgcacaccag 

accagagagg 

ggacacagct 

ttcacacata 

agcctgtttg 

ggcaccctgg 

tgtcttcagt 

tttgcagtct 

tatgtgcatg 

gattgcctgt 

aagcaccttc 

atagaagagg 

cacctttagg 

ccacagggag 

gtacttatta 

aaagtggctg 

accatatggc 

gtcaggctgc 

accaatggga 

ggatgcacag 

gttcacatcc 

ctgagtgcct 


cacttgggga 

acagatagag 

ccaaggtgag 

cctctcttcg 

tttctggggg 

gaggctcaac 

acgctcctcg 

tctgcataaa 

ccccaccccg 

ctcgcgaccc 

ctgggggtga 

gaggccaggg 

cggggagggg 

cggcccccgc 

ccactccgca 

tccccaagcc 

tctgcctcca 

aaccaccacc 

tcggggaggg 

ggttttccct 

ccgcctgccg 

ccgctcctgg 

gagtgcagag 

caattctgat 

ggctcagagc 

ggttttccta 

agacaaaggg- 

gtgggggaga 

gaaagaagct 

aaattgagct 

cttgtttcct 

taaaatgaga 

ttggggactt 

ggataaataa 

gatatagata 

tgttaaaggt 

agataaaata 

gggaagtttg 

ggagaaaagc 

actccctcca 

agaggcgaac 

gaaggatggg 

ccagtctctc 

gaacctgccc 

ggcacctcag 

cacagacaca 

ccgatccccc 

tctcctccaa 

caatggagcg 

ttacaaaaat 

ttacaggtag 

catccaggtc 

agtgggccca 

taggaaggca 

gcaaggagag 

gagaaggcgg 

tgtgtagtca 

agatgagact 

actcactagg 

tcccgaaatg 

tcccgttcag 

cctggtggac 

cactctgcat 

ccttctcagg 


aggtttcaga 

gaggcaaagt 

acaggtcagc 

tggcaatctg 

gtgggtcaga 

acgcagatac 

aggtccgacc 

gctccaatta 

agcccgtttc 

gggccgctcc 

ggggcgcgcc 

ctctccggga 

ggcttctcct 

acccctcctc 

gcgccgcgac 

ccggggagtg 

tctcccaccg 

accgttgcga 

tgtagggcga 

cccgggtcgc 

ggcacaccaa 

ctcgcgctca 

accccggcaa 

tttgtacttt 

agcagggatc 

tctgaaaacg 

agggaaaatg 

tggccaaggc 

tgggacctcc 

ccaccaccta 

acagcgacaa 

tagctgtgta 

gctctaacct 

ggtatgtctt 

attttaaagc 

aaatgtaaat 

ggagactcca 

agtcatacct 

cagcttcctg 

agcactaagg 

gggctggcat 

ggtgtgggga 

acttcccttt 

tcctggccct 

gcccctttcc 

tgcatgtgtg 

caagaggtac 

cccaaggagg 

ttgacttagg 

agaacttctc 

aaaagccata 

ccttggtctg 

tcccattggc 

acaggaatcc 

agaaacagag 

ccatttttct 

ttgttccacc 

tctcaggtat 

taggtatgag 

gggccttctg 

ggtggcgaga 

agaaggcttg 

tactcactgt 

gctgttgtgg 


caataccgag 

gagtggagac 

agccagaaac 

agaccgagga 

gaggaagcct 

ctggataggc 

ttgtccctcc 

acatgttttt 

ccgccgcttc 

tggcggtcct 

ctggggcaga 

aaaaggcagg 

gggagactcc 

ctccccgcca 

aaacacggct 

ggggagtgag 

gggtccggct 

gggccggtga 

gtttaaagga 

cgccccccgc 

ggacctggcg 

gcctccccag 

gcctactgaa 

acaaaaacaa 

tgcgatgtga 

gaggggacag 

gcttagctgg 

ctctgaggtc 

tttctgcaga 

acactgtgtg 

gaaagaatga 

gaacagacag 

ccagggctta 

tctgcaggat 

ttcacgttat 

atagtataac 

aaaaagtgtt 

tagcattctt 

gatgtacccc 

gcagggcatg 

gccaagcgtc 

agacaccccc 

ttatttcctc 

ggaaggggcc 

ttctgaaagg 

cacactcatg 

caggaggcag 

aagaaagctc 

gggcgttttt 

ttggtattta 

tggggcactc 

agaagtctat 

ggcgtactcc 

caggagtgag 

tcaagtaggt 

ccaggtcctg 

agtatctcac 

aacaagtggc 

gaaggcacag 

ggctcacccc 

ggaggctctc 

gaaggcccag 

gtgactttgg 

cgaagatgta 


35520 

35580 

35640 

35700 

35760 

35820 

35880 

35940 

36000 

36060 

36120 

36180 

36240 

36300 

36360 

36420 

36480 

36540 

36600 

36660 

36720 

36780 

36840 

36900 

36960 

37020 

37080 

37140 

37200 

37260 

37320 

37380 

37440 

37500 

37560 

37620 

37680 

37740 

37800 

37860 

37920 

37980 

38040 

38100 

38160 

38220 

38280 

38340 

38400 

38460 

38520 

38580 

38640 

38700 

38760 

38820 

38880 

38940 

39000 

39060 

39120 

39180 

39240 

39300 
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agttaaaaaa 

ttatttcatc 

gaaagatcag 

gcagggctgc 

aagggtcccc 

ccaatggtgc 

gtttactaaa 

actctgggag 

aacatggcaa 

gtagccatca 

caggaatcag 

aacaagacac 

atgtaatccc 

ccatcctggc 

ttagctgggc 

aatggcatga 

gcctgggcaa 

aaaataacaa 

tcaggggact 

cttggaaacc 

gcaccacata 

aaccctcgct 

tggctgctga 

aaaaggggtt 

tcagcccagg 

nnnnnnnnnn 

nnnnnnnnnn 

aagaaaaaaa 

tcaatgtttg 

aaaagtttat 

aaaatatctt 

ttttattttt 

tcttggctca 

aagtagctgg 

ttgaaatgga 

tgcaacctcc 

attacaggca 

tcaactaaag 

cccacctcgg 

tacagtgttt 

actactcact 

agtgccctat 

tctatgatta 

agtactatgt 

atactgtcta 

atttctctga 

ctcaaatttt 

ggcgcaatct 

gcctcctgag 

ttttattaag 

ggtgatctgc 

cagccattac 

tgtcatcaag 

gctcaaggga 

atgctcagct 

ctcaggtttg 

ctgggattac 

ccacttatgt 

ttgcttaagg 

tatgtgtgtg 

cacaaacatg 

gagtacttgt 

atgccctgaa 

ttcctgcaca 


aagtatgcat 
acaattcttt 
cgtcagggag 
ctcgtgcagg 
tttcaggggt 
ctgaacacag 
aacagact cc 
gccaaggtga 
gaccctgttt 
cctgtggttg 
aggctgcagt 
cttctctaca 
agcactttgg 
tagcacggtg 
gtggtggtgg 
acccgggagg 
cagaatgaga 
taaattaaaa 
ggggaatctg 
tgcagattag 
caaggcccta 
ctcacccaga 
agcaaggcga 
ggatgggaac 
caagaacttt 
nnnnnnnnnn 
nnnnnnnnnn 
aatcttaaag 
tatttcaagc 
aaagtaaaaa 
tgtataaact 
ttgaaatgga 
cggcaagctc 
gattataggc 
attttgctct 
acctcccagg 
tgcaccacca 
agaaccatgt 
cctcccaaag 
ataaagcctc 
cactcactca 
ataggtgaac 
gctacacaaa 
atgggtttgt 
gtttgtacac 
gtgtattcct 
tttttctttt 
c 99 c gcactg 
tagttaggac 
agatggggtt 
ccgccttggc 
gtttttttgg 
gctggagtgc 
ccctcctgcc 
aattattttt 
tctcaaactc 
aggtgtgagc 
tttcaagtta 
gatctgcagg 
tttatgtacg 
taattataag 
gccagtctgg 
gtacggtagg 
agctcagctg 


catgcttagc 

gggaggaggg 

gcagatgagt 

agtgggtgcc 

gctggagaca 

gcccaagagt 

taggaggtca 

gaagatcatt 

atctgtacaa 

cagctactca 

gagctgtgat 

aaaaaaaaaa 

gaggctgagg 

aaaccccgtc 

gcgcctgtgg 

cggagcttgc 

ctccgtctca 

acaaaaacag 

tatttttaac 

agacctctga 

cccctgcccc 

agggctctcc 

acacagcact 

ctgtccccaa 

ttcttttcct 

nnnnnnnnnn 

nncactccag 

aataaggata 

caagtgttat 

agctaagtaa 

tagtgtagtc 

gtttcactct 

tgtctcctgg 

acccgccacc 

ttgacccagg 

ttcaagagat 

cactcggcta 

tggccaggct 

tgctgggatt 

cagtagtgta 

cccagagcaa 

cattttttaa 

tgcttaccat 

agcctaggct 

tctatcatgt 

gttgttaagc 

gatacagggt 

cgacatctac 

tacaggcacg 

ttgccatgtt 

ctcccaacat 

ttgtttaatt 

aatggcacaa 

tcagccttcc 

tatcttttat 

c tgggctcaa 

ctgaaaacct 

agcaaggttt 

cctgacctcg 

tgcatgatgg 

cagggataaa 

tgatcaggaa 

agttggttag 

ggcactgagg 


acatagtgag 

tttacgtgtg 

tcaatgtaag 

ctaccactga 

ggatcccaca 

caggactggc 

ggcactgtgg 

tgaggccagg 

aatttttttt 

gaaggctggg 

tttaccaccg 

aacaataggg 

agggcagatG 

tctactaaaa 

tcccagctac 

agtgagccga 

aaaaataaaa 

actcctacgg 

actccgtgag 

ggtgcctctg 

caggagagtg 

tttcaggggt 

ggtgagggac 

aacgggagat 

tcccaacatg 

nnnnnnnnnn 

cttgggtgac- 

taaagaaaga 

tacaaaacag 

gctagggtta 

taagtgtaca 

tgttgcccag 

gttcaagcga 

atgcatggct 

ctggagtgca 

tctcctgcct 

atttttgtat 

ggtctagaat 

gcaggcatga 

cagcaatgtc 

ctgccagtcc 

tattttatac 

tgtgttacaa 

ataccatgtt 

ttgcataaag 

aacacatgta 

cttgctttgt 

ctccccggtt 

caccaccacg 

ggctaggctg 

gctgggatta 

tttttttttt 

ccatagctca 

cagtaactga 

tttttgtaga 

tcaattctcc 

tctagtgtgg 

aggtcactta 

gttttcattt 

ggggaaagct 

ttctatgatg 

caatgtcctt 

gggtgaggca 

acccagtgga 


tgcttggtaa 

ggtggcccca 

gaaaagactt 

gggtatctaa 

ctaggtagaa 

cacttcacaa 

ctcacgcctg 

agtttaagac 

taaaaaatta 

gccggaggat 

cactccagac 

ccgggcgcgg 

acgaggtcgg 

atccaaaaaa 

ttgagaggct 

gatcgcacca 

ataaaaataa 

tcaggctgag 

gggttctaaa 

gctgagatga 

gctcctgctc 

cccaccatcc 

tgcaggcctg 

caaagggtgg 

ggnnnnnnnn 

nnnnnnnnnn 

agagtgaaac 

aaatattttt 

tcaaaagttt 

atttttttat 

ttgtttttat 

gctggagtgc 

ttctcctgcc 

agtttctttg 

atggtgcaat 

cagcctcctg 

ttttagtaga 

tcctgacctc 

gccaccatgc 

ctagaccttc 

tgcaagctgc 

tatattttta 

ctgcctacag 

gcctacgtgt 

ataaaatcac 

taaacattta 

cacccaggct 

caatcgattc 

cctggctaat 

gtctcgaact 

caggcatgag 

taagagacag 

ctgcagcctc 

gactacaggt 

gggggggtct 

tgctttggcc 

aagtggaaga 

tgaagcctga 

gttttaacag 

cagaaatcaa 

aagaagtatg 

tgggaagtga 

gtaagaccca 

ctctgctaca 


atagaagcag 

cagggcagat 

actaacagca 

gctaagaggg 

ctggattgga 

agcacctgga 

taaccccagc 

tagcctgtgc 

gccaggtatg 

cgcttgagcc 

tgggcaacag 

tggctaaggc 

gagatcgagg 

aaaaaaaaaa 

gaggcaggag 

ctgcactcca 

ataaataaat 

atatcctgat 

aggcagacaa 

gtgagggatg 

cccccacacc 

ccatgaaaag 

tcagcgtccc 

tgggggcctt 

nnnnnnnnnn 

nnnnnnnnnn 

cctgtctcaa 

gtgtagctgt 

ttaaaaattt 

cgaacaaaga 

tttatttatt 

aatggcatga 

tcagcctccc 

catttttttt 

ctgggctaaa 

agtagctggg 

gacagggttc 

aggtgatcca 

ccagccagta 

acattcactt 

atgcatgata 

ctgcaccttt 

taatcagtac 

gtagtcgtct 

ctaatgacac 

caagaaatag 

ggagtgcagt 

tccggcctta 

ttttttgtat 

cctgacctca 

ccaccatgcc 

attctcactc 

caactcctgg 

gtgagccacc 

ttctatgttg 

tcccaaaggg 

taggcccagg 

ctagttttgt 

tgtctatgtg 

gtaagccaaa 

ggccacggga 

catttgagcc 

gagctggggc 

gggcagtgag 


39360 

39420 

39480 

39540 

39600 

39660 

39720 

39780 

39840 

39900 

39960 

40020 

40080 

40140 

40200 

40260 

40320 

40380 

40440 

40500 

40560 

40620 

40680 

40740 

40800 

40860 

40920 

40980 

41040 

41100 

41160 

41220 

41280 

41340 

41400 

41460 

41520 

41580 

41640 

41700 

41760 

41820 

41880 

41940 

42000 

42060 

42120 

42180 

42240 

42300 

42360 

42420 

42480 

42540 

42600 

42660 

42720 

42780 

42840 

42900 

42960 

43020 

43080 

43140 
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gagcagaaag 

agaggctgat 

gcaagactcc 

gcagtctcag 

gcagtgcgcc 

caaaactgaa 

ggctgaggtg 

aaacccgata 

gaggctgagg 

gcaccactgc 

aataacaatt 

ggtcacttta 

atacgacagt 

ctgccctgcc 

ttctcagtga 

atggaatgca 

atgttcatgg 

tggggctcag 

ggaggtttat 

gcaagcctca 

gtctgtggtc 

catgaagcgt 

accaatactg 

ttgagaaatc 

tggctcacgc 

tgagtttgag 

agccaggcgt 

tcacatgaac 

ctcttggcaa 

gtagactttg 

acacctccag 

gtgtactttt 

tcagtccctc 

cactcaatac 

gagcctcatt 

tctccgagtt 

tgggtggctt 

aatatggctt 

gaggtgcaca 

gggaggcctg 

gctgcagccc 

tctgtctctg 

ctcttccaaa 

ctgcgccact 

gcctcttagg 

ctagttactc 

aggccaagcc 

gtgggggtgg 

gcaaactcta 

acaggaatca 

gaggggtata 

ggcacatctg 

tcttgctctg 

cctcctgggt 

acaccaccac 

ccaggcttgt 

tggtattata 

gggataatgt 

ataagttttc 

caagattcag 

actaggagag 

gctccaattc 

ttcaagttct 

ctgtttgatg 


gctgaggaag 

gggggaaaat 

atccctgtaa 

ctactcaaga 

acgattgtgc 

tgaataggct 

ggtggatcac 

caaaaattaa 

catgagaatg 

actccagcct 

aaaaaaaaat 

gtaaagggag 

catggaaatg 

ttggccactt 

gtagctctca 

taaaagatgg 

agccctggga 

aggaggaagg 

ggagaaaggg 

actctgtaga 

ttggggacat 

atctcttcgc 

aagatcttta 

aggacaagtg 

ctgtaatccc 

actagcctgg 

ggtggtgcat 

ccaggaggcg 

cagagccaga 

tgtttttctc 

tcttgcctca 

tgattgtcct 

cccatcccga 

atcttgcatc 

atagccgttc 

tcgcttgtca 

gctttatctg 

aatggtggac 

a gttagaggg 

tccagctgct 

aggtgttctg 

acttgtcaac 

agccttgggc 

cacctgctct 

tatcacttcc 

cttggaggta 

cagcctaaag 

caatggagtt 

gagaaccgtg 

gcatcccaca 

gggaggagcc 

tgttttcttt 

tcgcccaggc 

tcaagtgatt 

acccagctga 

cttgaaatcc 

ggtgtgagcc 

gcattttgtg 

ccccagggag 

cctctcaaaa 

ggcaagtgag 

ctagtgatag 

gagattgtta 

acgtcccaca 


gctgggtgtg 

cggtagagct 

aaagctttta 

ggctggggta 

cactgtactc 

gtgtgcggtg 

ctgtgattgg 

ctgggcatgg 

tcttgaaccc 

gggagacagc 

taaaaggcca 

aacaatggct 

ctagggccca 

cctggccctc 

tttagtgcca 

ggctgtgata 

gggagctcag 

acttgttcaa 

tgaccatcca 

cttggtgggg 

tgtcgctccc 

cgtcccccat 

gtaaagttct 

agtcagggcc 

agcactttgg 

ccaacatggt 

gcctgtaatc 

gaggttgcag 

ctacctctca 

aaaagcactg 

gggtcagttt 

aaatccagag 

atcccaggga 

ctcgctggac 

gttcctgttg 

gcatttctcc 

aaattttcag 

cctgtcaggg 

ttagacaagt 

ccacggttga 

gctgccctag 

taatggcact 

ctctgactgc 

ggcgctaggc 

cctgatccca 

tctgcttctc 

gacgcttcct 

ggggggggct 

taaataggag 

gcccatgttc 

agcagggtct 

tattttattt 

tggagtgcag 

ctcctgcctc 

tttttgcaat 

tgacctcaag 

actgcgctcg 

gaagcttggg 

gacactgttc 

tggagacagc 

tggtgtttgg 

gaaaccatta 

acacacctta 

ctgtgggtac 


gtggctcaca 

caggagtttg 

aaaattagct 

aggattgctt 

caacctagga 

gctcactcct 

gagtttgaga 

tggctcacat 

gggggg c aga 

gagactccat 

gggagcactg 

cctcccagga 

ggcagaccat 

tgcatgcccc 

gggctctcgg 

gcccagagct 

tgcaagttca 

agacacacag 

aggcttggac 

gccaggccct 

cttcctgctg 

ccttgctgag 

cttttttttc 

aggacaaaaa 

gaggccaagg 

gaaacctcgt 

ccagctattc- 

cgagctgaaa 

aaacaaaaac 

tcaagccagt 

ccagcctccc 

tctgtggcct 

gccgcaggtg 

ccaatccatt 

acctttccag 

aatcccatca 

atttgacttc 

gtagagaaaa 

ccagccacaa 

gggtggagca 

ctgcctggct 

atgagattgc 

aacatggagt 

gtgtgcctaa 

aatacttacc 

accggggact 

acatgacttc 

cagggagggg 

tgattattct 

agctatgaag 

tgagttcata 

tgaatttaat 

tggcgcggtc 

agcctcccga 

tttagtagag 

tgatctgcta 

gccacatctg 

ccgtttgtgt 

cacttaggga 

agttccaggc 

gatgtgggga 

agctacttgg 

caacaccgcc 

ctttatgaac 


cttgtaatcc 

agaccagcct 

gggtgtggtg 

gagcctagga 

gacaaagcga 

gtaatcccag 

ccagcctggc 

ctgtaattzcc 

gggtgcagtg 

ctcaaaaaaa 

gcagcctgtc 

cctctgggat 

ctcagggaaa 

agggtctcag 

gcttacatcc 

aggggtttga 

tttctctttt 

ggagtgtttc 

aaagatcatg 

cccaaacaca 

atgctctgct 

agaggatggg 

attttctgaa 

acagtgtggg 

tggcggatca 

ctctacaaaa 

gggaggctga 

ttgggccact 

aaaaacaaac 

gcccgcagca 

tggacacttc 

gacctggttt 

tgtgcagaag 

ggcttggtga 

atcaatctgc 

tgtactttgg 

aggtctctcc 

tattgaggag 

ccagcccaag 

tacaggaagg 

ttggtagaag 

acataattaa 

ctgggtatca 

tcacttaatt 

aggtgtggga 

ccgaaaccaa 

aggcttgcgg 

atgtggaagt 

gtcccttccc 

aatggaaact 

ttagtgccct 

tttttttttt 

tcagttcact 

gtagctggga 

acagggtttc 

gcctcggcct 

tgttttaaat 

ctaggactct 

gtcaggaccc 

ctgggctggg 

agtattatga 

catcttaaaa 

gccgttatta 

aggaatttgc 


cagggctttg 

gagcaacata 

gtatgcatct 

ggtggacgct 

gatcctgtct 

cacttttgga 

caatatggtg 

agctactcgg 

agctgagatc 

aaataataat 

caaggtttca 

ctcagcattg 

acaagtggct 

caccaagctg 

tacgatgacg 

atctcatgag 

ttggttgaga 

agtgtgggac 

acttcgacca 

cctgacaggt 

gtccctctcc 

ttctcttctg 

agtccctctc 

acgagtgtgg 

cttgaggtca 

tacaaaaatt 

ggcaggagaa 

gcactctggc 

gacaaacagt 

gtgggcctag 

ccccaggtat 

gtcacagctc 

aggcacacca 

tgtacagact 

cagcttggct 

acctctttgt 

tttgtccctt 

ccctgacttt 

ctgcagtgta 

cttccttctt 

aaagaaaggc 

cctgggtctg 

ctccccatcc 

tctctgtgct 

tgacacctga 

acgaaaagca 

gggctggagc 

gctttgcttt 

tttctttcca 

gaggctccgg 

ttcctccata 

tttggcagag 

gcaatctccg 

ttacaggtgt 

acagtgttgg 

cccaaagtgc 

gagaggaaag 

tatgatcttc 

ccagtcctta 

ttctgttcac 

aaacagagat 

ccaagagcgg 

ggaagaagct 

tttttcaaat 


43200 

43260 

43320 

43380 

43440 

43500 

43560 

43620 

43680 

43740 

43800 

43860 

43920 

43980 

44040 

44100 

44160 

44220 

44280 

44340 

44400 

44460 

44520 

44580 

44640 

44700 

44760 

44820 

44880 

44940 

45000 

45060 

45120 

45180 

45240 

45300 

45360 

45420 

45480 

45540 

45600 

45660 

45720 

45780 

45840 

45900 

45960 

46020 

46080 

46140 

46200 

46260 

46320 

46380 

46440 

46500 

46560 

46620 

46680 

46740 

46800 

46860 

46920 

46980 
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cccagagaag 

gaattcaaga 

gttcagagct 

ggtagagctc 

tggctcatgc 

gagtttgaga 

agccaggcat 

ttgcttgaac 

ctgggcaaca 

acatactgtt 

cagctatata 

cctcctcagc 

tctaactttt 

aggcatgatg 

ttgagcccag 

aaaataaatt 

gagaggagaa 

gcactgcagc 

cgcggtggct 

gtcaggagat 

aaaaaagaaa 

ttgggaggct 

gatcgcacca 

ggaaaaagaa 

tatcacatct 

atgcagtatt 

catttgggct 

cactacttct 

cctgatcata 

cttttttgtg 

ggtggaaggg 

cacctggttg 

tatgtggtta 

gacaacaaaa 

nnnnnnnnnn 

nnnnnnnnnn 

ggacatttgg 

actagtatct 

ggcctctgta 

agtaggcgct 

ctggtgtgat 

agaggtggag 

gatccaggtg 

tggctggatg 

aggggtgaag 

ctgagaaagt 

gtgtgtgttg 

ggagaagcat 

ggagatgaag 

atgacataac 

atccccaata 

aatggccaca 

atcactactg 

ggggtagt c a 

acaagactgc 

ccacttgctg 

taacctccaa 

atggttggcc 

gtctgttgag 

actttaagtg 

acttctgagt 

ttatttatca 

ataaatgaat 

gaggctcaga 


taagattaaa 

atgactgacc 

ctcccaatag 

aggaatctaa 

ctgtaatccc 

ccagcctggc 

ggtggcagac 

ctgggaggca 

gagggagact 

acaggcacag 

actaccacct 

agaggtaaca 

ctatcaatta 

actcatgcct 

gagttggaga 

agctggtcat 

tctcttgagc 

ctaggtgaca 

caagcctgta 

ggagaccatc 

gaaaaaaaaa 

gaggcaagag 

ttgcactcca 

aaaatatata 

gcgaaattaa 

ccgtcgtgtg 

gttaccaggt 

ggtgagcgta 

aggtaggtgt 

tatttctggt 

aagcaggagt 

gcatggagct 

gctccatgca 

gctgatgtgg 

nnnnnnnnnn 

nnnnnnnnnc 

taatgtctgc 

ttttggtaga 

acaaaaaagt 

aaaacataag 

tagctctggg 

gagggtggta 

aaaaacggtc 

agatggtgaa 

gatgaccaga 

gaccacaaaa 

agttcagtct 

cttgggtgac 

acttgggaat 

ccgtggtggg 

tttaaccatc 

aaggctactg 

cagaggtcaa 

gtgggcacct 

tatggtaaga 

gaggagcttg 

gaaacacaaa 

cctagaaatg 

cccaggctaa 

atcctgacaa 

atctcctcaa 

gctaccttaa 

gaatcaatcg 

aaatacctgt 


gttggctgtt 

atacagaatg 

tcacccctga 

aatccatttt 

agcactttgg 

cacatggtga 

acctgtagtc 

gaggttgcag 

ctgtctcaaa 

accttaagtg 

atatcaagac 

gacccacacc 

gttttgccca 

gtaatctcag 

ctactctgga 

ggtggtgcgt 

ctgggaggtt 

gagtgagatt 

atcccagcac 

ctggctaaca 

ttagccaggc 

aatggtgtga 

gcctgggcga 

tacattgtgt 

tctacactgt 

actacgggac 

tctggctgtt 

tgcactcatt 

gttggctttg 

tagagcggaa 

cggtttctgg 

ggcttctcct 

atgaacccgg 

atttaaaggc 

nnnnnnnnnn 

cagcagtggt 

agacattttt 

ggctagagag 

atccagtcaa 

gagactgtgc 

ttttagaaag 

agactggagg 

agcaggtgac 

gaaagcacta 

gtcctgcctt 

agtgaagcag 

gagatgtgtt 

catatgtgtg 

gatctgcgta 

tgtgcagaat 

tggaagaata 

agaagggaag 

gtaggataag 

gggcaaaagc 

ggaggaagag 

gctttggtgc 

atcatccata 

ccatcccact 

gcgctacctc 

cactgaaaat 

tatattgcct 

acctccctgc 

atgatccaga 

agaatcgaaa 


ctccatcctt 

gggagcaaac 

actgcacccg 

aaaattaaag 

gaggccgagg 

aaccccgtct 

ccagctattc 

taagccaaga 

aaaaaaaaaa 

tacagcccaa 

acattccagg 

tctcctgctc 

ttcttgagct 

cactttggga 

caacatagtg 

gcctgtagtc 

gaggctgaag 

ctgcctcaaa 

tttgggaggc 

cggtgaaacc 

atggtggcgg 

acccgggagg 

cagagtaaga 

actttttggc 

gtgtatgaaa 

aatttgctta 

atgaataaag- 

tcgcttatgt 

taatgtgctg 

catgagggtg 

ctcacacatg 

ttggcgttgc 

gcttctgcaa 

ttcagttcan 

nnnnnnnnnn 

tctcaactga 

tgattgtcac 

gctgctaaac 

aaatgtccac 

ctgagagcaa 

ctcattttgg 

cagggaaagt 

taggaaagtg 

taactaacta 

gcaggtctag 

gtttgtgcgt 

ggactcacaa 

agtctgcagc 

tatatttggt 

taggagagac 

agaggagcct 

cagttcttaa 

aactgaagaa 

agttttggtg 

ggtgttgagg 

aaagcagaga 

tcctggctca 

tctcctctgc 

ctcaagcaag 

gtgtgtctct 

tgttttacta 

aactagagat 

gcctggtaga 

taaatgcatg 


gaaaaatttg 

ttgggaagaa 

gaccatcagt 

tatatcgggg 

tgggaggatc 

ctactaacaa 

ggaaggctga 

ttgtgccact 

aaaaaattaa 

tgaaatttta 

aactcagact 

cggtggtaat 

tcacacagat 

ggccgagacg 

agacccccga 

ttagctattt 

tgagccgtga 

aaagaaaaaa 

caaggcgggc 

ctgtctctac 

gctcttgtag 

cagagcttgc 

ctctgtctca 

atctggttta 

ggttggttct 

tccgtattcc 

ttgctatgga 

aaatatcttg 

acttggttat 

tctcttcagg 

ttgtgactga 

ctactgttgg 

aatacattaa 

nnnnnnnnnn 

nnnnnnnnnn 

gcatagtttt 

agcccagccg 

atctaacaat 

agtgttgaga 

gaaggagtaa 

ctgcttgtag 

aatttgggag 

gcagaggcaa 

atgtgtggat 

ttggaaggtg 

gtgtgtgtgt 

tgtccatggg 

tcagaaacag 

agcttgagcc 

gtgcaccaag 

gccaacagaa 

gaagggggaa 

tgtctgttgg 

gagcaatagg 

aagtggccag 

agccagctca 

aattccagca 

ttatcctatc 

ccttctctgc 

tccattcatg 

atatgctcgt 

tctctttaag 

ggcttgtgtc 

tgtgctctga 


gttttagggt 

agaaggcaca 

tatctctgtg 

c tgggcgcgg 

acgaggtcag 

tacaaaaatt 

gtcagaagaa 

gcactccagc 

agtatgtcat 

cacatctata 

ccatcatacc 

taaccactat 

atacattgtc 

ggagtatcac 

ctctacaaaa 

gagacgctga 

ttgcaccact 

tatggccggg 

ggatcacgag 

taaaaataca 

tcccagttac 

agtgagccga 

aaaaaaaaaa 

ttttgctcaa 

ttttgttgtg 

tatcggtggg 

tattcttgta 

ggtggaatta 

gctgaattcc 

gaatctggag 

actgctggta 

ggcaggtgtg 

caacgacaga 

nnnnnnnnnn 

nnnnnnnnnn 

gcctcagagg 

agaaggtact 

gcacaggaca 

ggtttaggta 

ttggaaagtg 

acagtgcatc 

ccactgaaat 

tggggatggg 

gatgggcagg 

atggtttctc 

gtgtgtgtgt 

acatccaagt 

gcctggggct 

acaagagtag 

aagccaggtg 

attgggaggg 

gtgaagaggt 

gtttggcaat 

gataacagaa 

cgagtctaca 

ctcattgact 

ctaccaggag 

ctatctgtca 

ctgccgtcac 

ttagttctac 

tctgtttgcc 

tatttgttga 

catggtggat 

tctaaactca 


47040 
47100 
47160 
47220 
47280 
47340 
47400 
47460 
47520 
47580 
47640 
47700 
47760 
47820 
47880 
47940 
48000 
48060 
48120 
48180 
48240 
48300 
48360 
48420 
48480 
48540 
48600 
48660 
48720 
48780 
48840 
48900 
48960 
49020 
49080 
49140 
49200 
49260 
49320 
49380 
49440 
49500 
49560 
49620 
49680 
49740 
49800 
49860 
49920 
49980 
50040 
50100 
50160 
50220 
50280 
50340 
50400 
50460 
50520 
50580 
50640 
50700 
507 60 
50820 
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gctaaacttt 

tatgtaatgg 

acaaggtcct 

cagtaagaaa 

tgtgtaacaa 

tcggagaagg 

agtggagctg 

gacaagagcc 

actatgtggg 

aaatatggtc 

ggaaggtaga 

actgagtgtt 

ttcctgagga 

ggggccttcc 

aatggggaac 

gccattcacc 

gttggccaag 

gtcacttttc 

ctggcttcct 

tttctcagtg 

cccgttcctg 

agggagatgg 

tttagcagct 

cccagcatgg 

ggctgtgtgc 

tctgttccct 

tgctctcagg 

tgaagccagt 

cttcaggaag 

ggcagttcct 

aatccatcat 

tgcctgtcaa 

gttcttccag 

tcaaagaatc 

taattgcagg 

ctgcagggtg 

ctggagctgt 

aggggcatcc 

tggtgcacac 

cagtaccagg 

cctaaagttg 

ctgtcaggtc 

ctccaagctc 

cagtgatgag 

gggtatcgtt 

ttgaaattca 

agagcctggg 

caaaagtaga 

ggtagcattc 

gtcagggctc 

caggactgaa 

tggagccaaa 

gaccttgtgt 

agtgcctctg 

gaggccttct 

tgcttcttct 

tattaggaac 

tggaatccaa 

acatgtgaat 

acttcataaa 

gcagctatta 

agccaccctc 

gctaagctgt 

tgagggtccc 


ctccaggggg 
aaaaactcct 
tgttgtcatg 
gcaagattat 
aagtaccaaa 
cttatctgag 
ggaccaggat 
tgatcatgaa 
aagactttga 
gcaggggcta 
agatggggca 
catggtctct 
agcgggaaga 
agatctcaca 
tgttgtccag 
cttgcagtgg 
ccc agttaac 
tgagggtggg 
atggtctgag 
tccatcttac 
tgggaccctt 
gccttgaggg 
tcaaacaact 
ctaggttttzc 
tcatctggag 
ggagttgcag 
tcctcgaggc 
gatggagaat 
agagccctgt 
tttccttaaa 
atttacagtc 
aagggccaag 
aaaaaaatca 
tcatggaggg 
gttcatggtc 
tggggaggga 
ggtggttttt 
caagagctcc 
atgtttttct 
tctgaccctg 
ggatgagccc 
ccc tttggaa 
tgtagtaact 
tcaacatttc 
ctcttgctca 
agtcctgagc 
caagtgggcc 
catcagtctc 
atttactaag 
ttaatggcag 
aggaaacgga 
gtcaggctgc 
ctggcaccca 
tgggtactgc 
tggcctccat 
ctttgataga 
ttttgtcttc 
gaaaccacag 
tatctcattc 
tgatgatatg 
agtgctagac 
tcagacactg 
ggatttcaga 
tgaacagtca 


taaagttcaa 

tctatgacct 

aagtttgctt 

atcattttgg 

taggagagtg 

gaggtggcct 

gacaagggtc 

agactttgca 

atgggaagca 

gtgaggtggc 

ggggaggcaa 

gggagcagag 

acccatctga 

tgccaatcgt 

gccgatttca 

cgttagctct 

aagcagttga 

gagagaatcc 

ggagagaatg 

ctctctgctg 

ctctgccaat 

ctgtattagt 

catgtttatt 

tgctcagggt 

tttagggttc 

ggctgaggtc 

tgcccacatt 

ttcccttgga 

ctcttttatg 

gtcaactgtg 

ccagggatta 

caggacttta 

ctcagtaaag 

tttggacctt 

ccttgaagcc 

tgcaggtgga 

ttagtcttta 

ttggtattga 

cctgttgcaa 

gacagcttgt 

cccgagaatt 

gtgctggtat 

ttagggctag 

ttggggaagg 

aaggcactgg 

acaccacagt 

ccttgtgagt 

ttgaaagttt 

caattactaa 

gcagcagaaa 

gtagctcatg 

agaacaggtc 

ggatgttgta 

aactccagga 

ctttttggtc 

gcccagtcat 

aaaaatgaga 

gaaggggtga 

atttctcaca 

aggtacagaa 

ccagtcaatt 

ctgtatactt 

tcctggattg 

actggtttcc 


gttgattagt 

gggcagagtt 

tctgaaggag 

taaatgttct 

gggtgggtgg 

tttaaccagt 

acttgtcatg 

gatggtggta 

tggggacaat 

agcagagata 

ttactgcaaa 

gttcctggag 

gacgtgggga 

cttggtctat 

cagaacaacc 

ttcaccagct 

tcagccccag 

tggagcagaa 

gtgggatctc 

taccttcgtt 

ccctacaccc 

cttctattct 

agctcatcgt 

ctcacaaggc 

tcttccaggc 

ctgttttctt 

gcttgccacg- 

ttgaatcacc 

ggatcacctg' 

gcatgtaaca 

tgcacagtgc 

ttggtgaaga 

ttagaggttc 

caccctagaa 

tattcatagt 

ggtgagggct 

agctgtcatg 

caccatctcc 

cagcccactc 

accaggagct 

agatcagaaa 

ggagaggatt 

aaaggaggat 

aggcagggct 

accccaaggc 

tgtgatgcag 

accgttcaac 

gattaatgct 

ataccagttt 

ctctccttgg 

aaattgcagg 

tagggaggat 

gggctcagac 

actcaatctt 

actagctcca 

atgcgttagc 

ggcctgtcct 

ggtgactggg 

ctacccagtg 

agtttaagga 

tgagtctgac 

ccagtgaatg 

tatgacctaa 

aagaatgggc 


caattgatta 

ataggcagtg 

agagataata 

tgtggaaata 

gcttctttta 

acaaatgctt 

ccagtgagtt 

atgggtttgg 

ggcctgtgat 

gggagaagta 

gacatattcc 

gggaaagagg 

ctgtgctggt 

gtcaattgtt 

gcccagtcca 

tttatctgcc 

agatcaggtc 

catgtaacta 

tggcctgaat 

attttccagc 

actgtaaatt 

gcataacaaa 

gagttcatca 

taaaatcaag 

tcacgtggtt 

gctgactgtc 

tgcgtggtct 

cacatggttg 

attagatcat 

tcacacaacc 

accaggggac 

acagtggaat 

tcttgccttt 

acatcacacc 

ttccaggttg 

gaatagtgtg 

tgttgggggt 

aaggtgatct 

ttgtagaaga 

acagcacact 

agattaaatg 

gactgagtct 

gcctaagatt 

gaggattaaa 

ctccagctct 

ggaaagaatg 

ctcatttatg 

ggtcacactc 

ctgtgctaaa 

ctgatctaag 

aagggccgga 

cccactgctg 

cctggatcaa 

gtcaacgcca 

gattcaaaat 

tgcaaaggaa 

ccaccaagat 

cagctcacag 

aggtaggtat 

acttgcccag 

ttggactgtc 

ttgatgaaat 

aagagagact 

tccctctcat 


attaattcat 

aacaagacag 

aacaagaaac 

aatgtgatga 

gaaagagttc 

tagcttggcc 

tgagcttgta 

gttaattgct 

acatgttatc 

gacggactgg 

ttctaagctc 

ataatgtcac 

tcgtttctaa 

ggggcatcca 

tatctcccga 

ccgtggggat 

cctggagtct 

gaagggccac 

caaacctccc 

agctcctcag 

tcaccgtggg 

ttgcctcaaa 

gcagtgtggg 

atgttgtctg 

gtggcagaat 

agatgagggc 

tttccatcct 

gactctctga 

acccatagag 

acaggagtaa 

aactgaattc 

gtcattcttg 

tgggaagtca 

atgttttcta 

aaaagctctg 

agctgcatat 

tgggcatggg 

ctgctctgcc 

gcagacccct 

cccccacaag 

cagaggtgat 

gtttaggaac 

caggatcctg 

cggagatgat 

tcgctcccat 

tgcttatcag 

tcattggcac 

aaagaccctg 

tgctgcatca 

tagaaaaatc 

aaaccagaca 

ctgagaccta 

tgtatcctgc 

ccgccagaga 

cttgaataga 

gctgaaaatc 

ccataggaaa 

catgcatgct 

tgtcatccct 

gacacgacac 

tgactccaga 

tttcagggtt 

tccctaggag 

caccttatga 
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cagtaatcct 

cgcagagccc 

ttctgtcttt 

aaatgcatcg 

atgaaggaaa 

tcaagctgtt 

ccaggtagga 

gacttccttc 

tttgacctca 

tgcttcactc 

atgggagggg 

ttacgtgagt 

atgtgccggg 

caccatcacg 

aaatgactaa 

atccaataag 

ttggaaaatg 

taatacttgc 

tctcgcccaa 

gttcaagtga 

atgcctgact 

gtctcaacct 

caggcatgag 

atgttaccat 

aagttagaca 

cccatcttcc 

cgctagacct 

ctttaataag 

aatgcaaagt 

cccctgcttc 

catcaatggt 

ttgtaaaact 

tcacagaaac 

taacattctg 

attgttttca 

tataggcgca 

ttttttattt 

accagtaaaa 

gcccaggata 

gtatcgggag 

ggtccttatc 

gttcctccaa 

tttgggaaga 

gtggagctgg 

agggctgg ca 

tcatcctctg 

ggagatggtg 

taccttctaa 

ttatgtcata 

tttacttccg 

gtttagaaag 

gcttttctgg 

gtcccctaat 

ctgtgttctc 

tttggggcag 

ctgttttttc 

tcgtcttcta 

tgaaaggatt 

aattttgtcc 

ctctcttagc 

ggtggatgat 

accaccttga 

ccttcccagg 

ctggggagga 


ctgtccaaca 

agctcaaagc 

ttaaatcaat 

ttcctcgttt 

ttacagtgtc 

ccttctactt 

atcagctcct 

tgaattagca 

atttactcat 

tgattttatc 

agagaagttt 

acgttactta 

cactgtacta 

aaggaagaca 

ctgataagga 

ggcagtgttg 

gggtatcaac 

catgaacagt 

gctggagtgc 

ttctcctgcc 

aatttttgta 

cctgacctca 

ccactgcatc 

aatgaaggct 

tacgtgaaca 

atcacagtca 

attagccaag 

gcagcctcaa 

tataaaaaga 

tggaggcaac 

cacataagga 

attcacatac 

atctcttcag 

tagaatagag 

tttctacaaa 

tttatgccta 

taataggtat 

caggggagat 

agaaccattg 

agaaatcttc 

agccttgcta 

ggcctctata 

tctaaagacc 

tgacagttta 

ggcgtctacc 

ccttgcacat 

gagacatgga 

gtcctatgtg 

gactctgagg 

ggcctgaagg 

aggtccttgc 

cccatcctga 

ggtgtgattg 

cgttgcagga 

tttctgctct 

cctttctatg 

tttctgaatt 

tctcaaggcg 

tttctccatt 

atcgccatga 

ggaaccagta 

ccttgtcagg 

cagagttgag 

acaaggggag 


gccaaagagg 

tcctgactag 

ggagacaggg 

ttctcattcc 

ctgtgcatat 

gaaatgcctg 

tggttcatgg 

gaggatgttt 

cttacaaatt 

tccacctgga 

ggtgccagaa 

accattaatt 

agcaccaagg 

gatgttaatc 

caagcccctg 

cggggagatc 

cacctacctc 

attttgttga 

agtggtgcaa 

tcagactccc 

tttttagtag 

aaagatccac 

cagccacttg 

ggtaggttgg 

tttcctgtac 

gctgggaagc 

cagaagtttc 

aattacagat 

agataaacaa 

aaggtcaact 

tgctcacaga 

taaatacttt 

gtttatagat 

aggacacatt 

tagtcaagga 

aaggatagtc 

ggacagattt 

atgtattttg 

cccagagctg 

actgaatgaa 

tctccctctg 

taagaagtct 

caggaaggtc 

ttctcccagg 

tgtggctttc 

tccttctgcg 

ggagggtgtg 

ttcctaggaa 

gacccattaa 

aaagccagcc 

gtcctggatc 

tttctattca 

tgatgtgtgt 

taaagccaag 

tcatctaag-t 

gtagatgggg 

tctgtgtgct 

tctgttatgt 

atgcttactt 

agtaggtggt 

cccaagccca 

cagcttgggt 

cactctgatg 

atgtgaagga 


tcctgtgggg 

gctcttgttg 

gagggttatc 

ttcccaatgt 

accaaggctg 

tttccttccc 

agccttttct 

cctagcttgg 

aggttgtaag 

tagcatcatt 

agaactggat 

acttcaatgt 

atacaatggt 

cattaaccaa 

gagctacaag 

aggagccaca 

actaggtttt 

ttgatgattg 

tctcagctca 

aagtagctgg 

agacaaggct 

ccacctcagc 

ccatgcatgg 

ccaactgagt 

ttgaagatcc 

ttttaaaaaa 

'tgggoagggc * 

tcagcacgca 

caatctgtct 

atttggtgtg 

taagcaccta 

cctcagtatc 

acaggtccag 

ttactcagtg 

ataacataac 

tcaaaagagt 

gttctcaaaa 

gaaaagcacc 

ttccagatgg 

tgagtgggct 

acagaggcaa 

ttgtgagagg 

tctgggtggg 

aggtccctgg 

aggttatgag 

ggatgtgaaa 

ggtggcttc:t 

agccaataat 

gtacaaacaa 

tcagccaccc 

ctggagcatc 

ggccttcttt 

gcatgtgtgt 

atgaaactcc 

aagtgttttt 

tcagagttac 

taaagggatg 

ctgtggtctt 

tgagcttact 

attgtatacc 

tgactgcccg 

tcccctcata 

cccagggcaa 

aggacactta 


agggcttgca 

agtattcctt 

tccatcctcg 

gtgtactgtt 

tccaacctcc 

ttctaattgc 

gctctgtttt 

tcttaaccct 

ctaattgaat 

tttgacacac 

ttgaattcta 

atatttatta 

gagtaaagag 

gtaatctcac 

agggtgtata 

cagagcctgg 

taaaatcagg 

attgaaacgg 

ctgcaacctc 

gattacaggc 

ttgcaatgtt 

ctcccaaagt 

catttaaaaa 

ggtctgattc 

tcaggacagt 

atgcagacat 

atctgcatat 

tttaccataa 

cctgctttct 

attcctttta 

tgcgggggtt 

ttgccttttt 

ctcttctttt 

tccgattgat 

tctgtaaaag 

gaaactgatc 

tgtttgtggc 

caaggcgatt 

cccctgggtt 

ccccagggaa 

actctctctc 

aagcaaagaa 

tgagtgcttt 

ctgtggctga 

gatgtcagca 

gtgctccttg 

tgaactctga 

cattgcttct 

ataagcgtaa 

ctcagggttt 

aggagctggg 

ttctccacct 

ctgtgtgtgt 

ccttacttct 

tgccttcagt 

acacccaccc 

gggactctat 

ggttctactg 

gagtgccttc 

catttcacag 

actctaagtc 

gagactgggt 

ggtgtgagct 

gctatcctcc 


gatgggagtg 
tgattcctgc 
gctcaagatg 
aactttagtt 
acacctttgc 
atctttccat 
actatgcatg 
tctccttttg 
acaggatcta 
aagcaggcat 
accctgttgt 
agtacctact 
atgcagcctt 
aagaaaagta 
cagggcatcg 
gttgtctcac 
ttaaatgagg 
agtctcactc 
tacttcctgg 
agccacccct 
gaccaggctg 
gctgggatca 
tgttcagtaa 
agaaggaaag 
gactcctaga 
ctgaccttca 
ttttaaaaat 
ccactgaaga 
tccctctcct 
gcattccctc 
ttttttttcc 
tcacttcatg 
catagccata 
ggatatcaat 
ttttattact 
aaatgtgcat 
agttcaaaac 
ctgaagtgta 
cctgaagtgg 
gtgatgaaat 
cctgggggaa 
ggacctgggc 
ctctgctgtg 
cagtttctgg 
ggggcagcct 
gctggggaaa 
ggaggggac:a 
cccgcctttt 
tagtcccttc 
gctgcgttct 
cttggcatga 
cactcccacg 
caatgacaaa 
ggctcttcta 
ctttctttct 
ccttctttga 
ggccaggagt 
tgacattccc 
tctcctttaa 
aaatacagct 
catgctctta 
tccaggttc c 
gtctgtggtt 
ctgccagggt 
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54780 
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55380 
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56160 

56220 

56280 

56340 

56400 

56460 

56520 

56580 

56640 

56700 
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57540 

57600 

57660 

57720 
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58140 
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58260 

58320 

58380 

58440 
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ctgagacttc cacctttgag 
caccagagca ggagatggag 
gctctggaag tgaagatgcc 
atatggtgga caaaaacctt 
tccctgggtg ccagacctgc 
cttgggtgag tggcctatgg 
cccccagcat tgctatcact 
acctgagcct cagagaggtg 
attttggtct ccaaccttca 
ctcaagaccc atgtgaattc 
ctgtagttta cttgccggag 
attaattatc gaatccagtg 
ggcaggatca caggctcggt 
tcagccagca ctcagcttgc 
aaccagggat agatggacac 
agagctaagg atttagaatc 
tatgtaactt aagttagctg 
gctgaggcag gagagtcact 
ccattgcact ccagcctggg 
taaataaaat aaaaaaaaaa 
gagttggcaa cccttgttct 
tgcatagaat ctgatggtga 
cccatcatgg tccagcaggg 
atacggggcc aggtctgctt 
tttctccagg gtcgctgcag 
tactgggctg ctcaccagcc 
ggtgaggtgg ggctggggat 
aggagatgct acaaagagcc 
tctgactcca cagcagtgtg 
ggacaagagg ctgaaagtcc 
ccactgcctc agaagacttc 
cagagctgcc ctctcctggg 
ctccctgcaa taaaatgggt 
cactctgctt agccctcatt 
catagaagtg tcagaagcta 
atggagcctt ttggagatgg 
tactgggcac ctgcatagtg 
cccagtttgc atctctgtaa 
tggttggttt tatatgtaag 
aatgaatgta taagtaccgc 
gggcaccatt tcccagtggc 
ctgccttccc ttcctcaact 
aacacttgta cgcaactccc 
gcttgccatg cgtcagactc 
ttgggatata tggttatttg 
ttgagcaccc acaacatacc 
atcaataatt atgagcccta 
agacttgccc aagactgaac 
agcatgttac tttaaccctt 
gggatgaagc acggggggag 
caggacacca gcccagctgg 
tctcaacccc ttctgtacta 
atttttttct ccagcattcc 
aggggtgaaa caagaactct 
ctaaggtcac atagtaagtt 
ttgttccaca agaatctgtt 
agtgctgtga ggatgcagct 
tctggtgtgg aaaggcattg 
ttcccattta ttaagcattt 
gtcttgttcc tcataaatct 
cagtccacag agatcttctt 
gggctcctag tcccatcctg 
gaagtttccc tgagcagtcc 
tgaaaaattt ttcttggccg 


acccctttgg gtgctaagac 
gagacccctt gcagggagct 
tccaagaaag atggggctgt 
acgtgtcctg aggaagagga 
cgctacctcc tggtgagaag 
ctgaggctga ggtgggagca 
ggctcttttt cccattgagg 
ttggggtaag aggggagggc 
aatggttgtg gcaggggagg 
atgtaggaag gatgctccat 
gtgctacagg ggcaacctgg 
ttctgtcagc gcgctcaacc 
aagagaagtg tgaacactaa 
atcagatttg tctgtttttc 
ccacagacaa cactgagggg 
aggaggtttg ggtccaagtt 
ggcatggtgg tgcatgtctg 
ggaacctggg agacagaggt 
caacaagagc gaaactccgc 
ttaaaacaag accatgagtt 
accttttgtt agggctggaa 
taattgccgt ggatcagcat 
aagggcccat tgcccggtgg 
ctctgccttc cctctgcccc 
acgctttcag tgggttgacg 
ctggtcccgc ggtggtcact 
gaacgatgga aaggtctggg 
tgaccctttg tgggagaggc 
tgggtgggaa aagaggccct 
ctttctgttc tgccttcaca 
ctttcatctg ttcctactga 
cagctgcctc ccctcctctg 
tttactgaaa tggatttatt 
gaaacttctt ccttatcatc 
ctactccttg aggaggagga 
aggaatgggc tcagctagtt 
ctgccaggac ctttcaaggt 
ccaaaggcct tttctctctc 
gaagttaaca tgtccctggg 
aatccccggc ccctcaagtg 
ctgctgtggt agttggcctt 
gcattctctg ctccttccta 
tctctcagga tctccttctg 
tgaatgaggc ctgggaatac 
taacataggc acaaaaacat 
ccctgctgtg gcaggcacct 
ttttacagaa ccaggctcag 
ctctaaatgc aactcatatt 
gtgcttttac agctggctac 
gaaagccaga ctgtctcact 
aggtgaggga gctttaatca 
gggaggtcag cagaagaaaa 
tccaattcat agatcttcat 
atttttttgg agtgtggtat 
gagaaagaga gagaaatcaa 
gatttcaaag atggtggact 
gaggtggaac tttcactcct 
cttcccttat ttcaatatta 
actaggtgtc aggtactgtg 
ctgccgctgt gggtgtgtac 
tgggggagta atatcaagaa 
tatctcga cc taggcatgtt 
actctatagt gtgctttata 
ggcacggtgg ctcatgtctg 


gctgcctgag gatgaggaga 
ggaggaagag gaggagtggg 
tgagtctatc tcagtgccag 
cacagtaaaa gtggtgggca 
tcttcagacg tttagtcaag 
tggaacgggt gtgggatatg 
gccctggggg tgtcagtaga 
cacctacaaa cagaagttgc 
gagggaatga attgtgggga 
tctttgtctt ttatcctgcc 
tttccatcca caacttcaat 
agggtcaagt ctggattgga 
atggggtgca cctgctgatc 
tcctgtataa tctccagaag 
gctgcctggg cattcaggga 
cctttccatc tctcactatc 
taatcctagc tacttgggag 
tgcggtgagc cgagatggag 
ctcaaaaata aataaataaa 
tgtttcctca tctctaggat 
ggacaagcct gtcactggga 
ttcagatgac taggacagtt 
gcagcagaaa gagctggcag 
atcccttctt cccctcttgc 
gcagccgctg gaactttgca 
gcgtggccct gtgtacccga 
agatgggaag tgccccaagg 
ttcctgggtc ttttatatac 
cctgtgggtt gagttgggat 
ggaggccact ggcgtcgagc 
gctggtccca gccagcagtt 
cttgccatcc ctccctccac 
ttctcctctg atcgcggatc 
tctccccaca ccacaacttt 
tggagggtgg agttgggtct 
ctcttcatag aacacctgat 
tgtaggtaga ctcccaatgg 
tctctccaac cccagaactg 
aacagtccac aacattcagg 
gaataaatct aacatgtatt 
attccatgca tttttatggg 
ctctctgcaa ctcccaaata 
gggaaacctg atataagaca 
aagacatagt cctctggcac 
ctactagttg ttatcgctta 
tgcctagatg acctcatgtg 
agaagttagg atctgtcaaa 
gaaattcaac tctgctccaa 
tctcccctta tggtcacacg 
cttgggttca tcttgggaca 
gaggggaggg aggaaggcat 
taattcaatg ttctaaagcc 
atgggattag gggctcagag 
agagaaggga tgctacttct 
actcaggttc atttcaacta 
atgggttcat ccctgtggtg 
tgccctcttg gactttatat 
acaacaaagg gtaataatat 
ctaaatgtta ggtgaacttt 
tttgacagaa gtttgacttc 
ggggca c gaa ggaagctgca 
tacattggtg cattcactgt 
ggagcacatt gtacatccat 
taatcccagc actttgggag 
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61500 
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61800 
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62160 
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gccgagacag 

aaccccgtct 

tcccagctac 

agtgagccga 

aaaagaaaga 

tgggctaata 

ctaagctctg 

cactgtgcag 

caagggccac 

gatnnnnnnn 

nnnnnnnnnn 

agagttatgt 

gttttcttcc 

ttttcccaaa 

gtttcttgta 

attgtgtagg 

gagactggga 

cagaatcatg 

gaagaatcaa 

tcacaagaat 

cccacccaca 

cacagccaaa 

acactattct 

gcagggacct 

gttgccacag 

cagtagtgtt 

cctgtcagga 

gagtcagtgg 

agcattattg 

gtttctgttg 

cttcgctgac 

ggttcctcca 

ctgctgtgcc 

accttcatcc 

acccttgcgt 

aagctggagg 

actgggtttg 

ttcatgcact 

ctcgta'ttca 

tgtcttccct 

tgtctggcat 

ggtcctgtct 

gtctgagcag 

caggggagct 

gggtccctga 

tttggcaagc 

cccatcttca 

aggatgggga 

gtgatggcct 

ttggggcaaa 

ctcacgaaac 

caggcccacc 

caggacctct 

ctctaggagg 

ttacgtgttc 

tactgcacag 

gtttaaatcc 

cacgatcttg 

ctcgatctcc 

ggcatgtgcc 

gctatgttgc 

tccaaaatgc 

atctgaggcg 

gctggtgcct 


gcggatcacc 
ctactaaaaa 
tcaggaggtt 
gatcgtgcca 
aagagatttt 
ttccaaaata 
tgaaactgtt 
agcacagagt 
cctcactcaa 
nnnnnnnnnn 
nnnnnnnnnn 
tcattgattt 
tattattatt 
ggtaaaactg 
ccccttttcc 
tgtatataca 
agaaaaagag 
gcaggaggtg 
aagtggaaac 
agcgtgggaa 
atacgtggga 
ccatatcaat 
gcagagtttt 
ctcatgcctt 
cattcctgta 
acaaagagga 
gggtattgca 
ggttggagag 
atgagtttga 
ggacagtgga 
ttctgttggg 
ctttggcggt 
tgggcttcgc 
tgcaagtgat 
aagtggcctt 
taaagaaatt 
ggggcaggtg 
tgagccttta 
tcctctgagg 
tctttgtgcc 
gccatccatt 
gtttttccag 
agctgaagag 
ggggacaagc: 
agctcactgg 
tctttgggct 
ccctcatcaa 
tccctggcag 
gggtatgtag 
agtatttgtg 
ttcaagcaaa 
ctggcctctc 
gtgctttcac 
tggtcgtgag 
agagg caeca 
ctctatttgc 
atcacttgga 
ctttgtcacc 
tggactcaag 
accatgccaa 
ctgggctggt 
tgtgattaca 
aatgacttga 
ctaccagctt 


tgaggtcggg 

tacaaaaaaa 

gaggctggag 

ttgcactcca 

ttctttttct 

tgggtttgtg 

tatctatggt 

aggggc:tcaa 

ctcatgtgcc 

nnnnnnnnnn 

nnnnnnnnnn 

cattttattt 

ttattttatt 

tgaaacaaga 

tettetttag 

tgtattagtc 

gtttaattgg 

aaaggeaett 

ccctgataaa 

agactggccc 

attctgggag 

acatttccct 

tttttttttt 

ttaaccactg 

ctgatgacta 

tctggctaca 

gtaatccatg 

aagtgggcac 

ggtggggcat 

tggatagtgg 

ttggcctaca 

ggccctctgc 

cctctgtgcc 

cagccgctcc 

ggggcgggct 

gggaggcaag 

tggagggggt 

catctgcggt 

cccacttctg 

cttagtactc 

tacatgcctg 

ggccccgcat 

ggaaaggcca 

ggccgacagc 

aacccctctg 

ggtgatggcc 

aggctccctt 

gaggcctggg 

catggtgcag 

tgtgtgggga 

atcctgagac 

ccctaggccc 

ctctgtgtcc 

tctcctgctc 

getaaggage 

ctcatttttt 

gaaaagaaag 

caggctggag 

cgatcctctc 

gcttatttta 

tttgaattcc 

ggcgtgaacc 

ttgtgacatc 

cccatgtgac 


agtttgagac 

ttagccgggt 

aatcgcttga 

gcctgggcaa 

taaaaagtaa 

tgtgaatttt 

tctttatcat 

tcaacagtgc 

aggeatagtt 

nnnnnnnnnn 

nnnnnnnnnn 

tgtttctgat 

tgtaaaacat 

tatattcaaa 

gtaaccattt 

tgttttcatg 

acttacagtt 

cttacacggt 

cccatcagat 

ccatgattca 

atataattca 

ctcttttaga 

gatgtaactc 

cctgggtctc 

tttggattgt 

tgttcagggt 

actgagttaa ■ 

atttgaatga 

gtggggaaag 

ctcctcccct 

gagagettet 

tcgacggtgc 

tcagtcccca 

ttcctctatg 

ctgtggagac 

gcggggcctg 

gagaccagag 

gccctctccc 

tttcagttct 

tagtcttact 

tcttttcttt 

gtgeettett 

gacaggtgtg 

ccccagaggc 

gtttctctcc 

ttgtcggctg 

cagaatgacc 

ccttaggcct 

cttcccaaag 

aacagacatt 

ctcaaaggtg 

acacagggtt 

ttacacctgg 

tttgcagaaa 

aaaagtcaac 

atttttaaag 

actaaatgtt 

tgcagtggca 

atctcagcct 

ttttattttt 

tggcctcaag 

actgtgcatg 

aggtg ac cta 

cttgaacatg 


ctgcctgacc 

gtggtggcac 

acctgggagg 

caagagcgaa 

aaatcatgaa 

cctctccagt 

tgaatccttg 

actcattgct 

ctgagagctt 

nnnnnnnnnn 

nnnggtcact 

ttttaaagat 

ttacatatca 

gaagtttact 

ttattttttt 

ctgctgataa 

ccacatggct 

ggtggcaaga 

ctcgtgagat 

gttaccctcc 

aegtgagatt 

taaaaggtag 

tatcctgagg 

cattacatgg 

ttccagtctt 

ggggaggggc 

tggtagttta 

tatgtaggag 

gattegagga 

ttttccaatc 

ttttcctctc 

cttcgctggc 

tcctccctct 

ggagcaacgc 

ggacacactg 

gaggeagtea 

gtggtgggaa 

tctgttttct 

ccagggaaga 

teetttgeta 

cctggtgcag 

ctgggttctg 

gttggagggc 

caggcttctg 

ccagtttccc 

tggtgtctct 

cattttacgt 

tggctgcccc 

ggtctgtgtt 

ctggagtagg 

tttcctgctt 

tccacagttg 

agggatgctc 

ctgaggctca 

tttgaattet 

cagcaaatct 

ttttgttttt 

caatctcggc 

cctgagtagc 

ttgatagaca 

cgatccaccc 

gccaaaagag 

gtaatcagct 

tcattgaatg 


aacatggaga 

atgcctgtaa 

cgaaggttgc 

actccgtctc 

ataaggggac 

aagatactaa 

gagttcctta 

ttttcataga 

tgettaaget 

nnnnnnnnnn 

aatttggtat 

tgttttactt 

gacatttaca 

ttccctctct 

aaatataaac 

agacctatct 

ggcaaggcct 

gaaaaatgag 

ttattcacta 

cccactgggt 

tgggtgggga 

tatactgtat 

gtgctctgta 

ctgcagcata 

ttgctattac 

agatgtgtag 

aagetaggat 

gtgaatgatc 

tgactcccag 

ttccttggcc 

tgttcgccca 

cctgacatcc 

ccagtacctc 

ggccttcctc 

gggcaaagag 

ggtgcgggag 

ggatagaaca 

acctggtgaa 

aatggaaaag 

gtgcgtgcat 

cctgcatgag 

tgggtcaaat 

aggcctagga 

cttggaggga 

ttcagagcac 

gctccagttc 

gagtactggg 

aaatctggct 

attcaagtat 

gtggggaatt 

gtggtgagtg 

gccccaggga 

tgaggtcctg 

aagaggttac 

gtgttttgac 

tagaatagga 

gttttggaga 

tcactgcagc 

tgacactaca 

ctggggtttc 

gtctctgcct 

taaacttgaa 

gtgtattcta 

ctcgctaggc 


62400 

62460 

62520 

62580 

62640 

62700 

62760 

62820 

62880 

62940 

63000 

63060 

63120 

63180 

63240 

63300 

63360 

63420 

63480 

63540 

63600 

63660 

63720 

63780 

63840 

63900 

63960 

64020 

64080 

64140 

64200 

64260 

64320 

64380 

64440 

64500 

64560 

64620 

64680 

64740 

64800 

64860 

64920 

64980 

65040 

65100 

65160 

65220 

65280 

65340 

65400 

65460 

65520 

65580 

65640 

65700 

65760 

65820 

65880 

65940 

66000 

66060 

66120 

66180 
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ctctgtttct 

agaatgaaaa 

gaatataaac 

ttcattgagc 

ctgcaggtga 

gtatatcggg 

cctcactttt 

gtcccaaaag 

taaagacaca 

tatcacacag 

ccttgcactt 

tgtcaactat 

cacgggttaa 

tctccctcaa 

acagggttcc 

cctggtctgg 

aaagttcttc 

ccccaagggc 

ttggggatgt 

ctctcgggaa 

atctctctag 

gtatgtttct 

agtgtggcag 

cctgggatgt 

cacatcctta 

gctggatgga 

agccccttaa 

atctctgatg 

cacatgaggg 

gagaagttga 

cacctgctgt 

gggggatggg 

aatgtttatt 

tgcagaaggg 

aggggccagg 

cgtttaaatc 

cccactgtcc 

tgatctcaac 

ctgtggacca 

gctcaccctg 

atcttctcca 

acacccttcc 

gcagcatact 

atccacatgg 

ttttggcctc 

gtgtggtgca 

ctctgtgact 

tctgggctgt 

gatcacagct 

agttggtgaa 

agtactgaca 

ccagacacaa 

tggctcatgc 

aggagttcgt 

agtgtagtgg 

tgagcctggc 

tgacacagtg 

ttccagaata 

gtggaggcca 

gggtggactt 

atgcctggat 

gggaagatgg 

tctgccctga 

cactctgtta 


ttatctgtga 

gtaactgaaa 

tcggtagact 

aacagttctg 

atgtgatgtt 

aatgccgtac 

cagccccgag 

actttgccta 

caaggacgtc 

actcaaagca 

agccttggca 

ttctttaggc 

attttcaggt 

gagtgcagcc 

ctggg cc cag 

tctcagaatc 

ctggagcaga 

catgggagcc 

tttcaagaag 

taattttata 

gctttgggcc 

gaggaggccc 

ctggacccag 

gggcccagag 

actaacttta 

atcagaaatt 

ccactctgag 

gcgattgtga 

tttgctcctg 

gggtgcagga 

ctcattccca 

ggtggaaggc 

gagcacctgc 

tctgaaaccc 

agcctgggtc 

taatggatgg 

acaggacaaa 

ctctcccctt 

agattcccag 

cctcagcttg 

aaaatcatta 

tggtgactgg 

ccaggaggca 

taccagctaa 

agaaccaatt 

gtggaaaaaa 

ttggatggac 

tgtaaggatt 

agcattacag 

cggacaaagg 

cctgctacaa 

gaaactgcta 

ctgtaatccc 

gaccagcctg 

catgcacctg 

aggtggaggt 

aggttccgtc 

ggccaatctg 

tgggtagtga 

atggtatgtt 

tccccaggaa 

gcgagaggag 

gagccttcct 

ccctggctgg 


aatgggcttg 

gtccttcctc 

tctggtccct 

ttgttcagaa 

catgcttgcc 

ttggaaagaa 

gatggttttg 

tcccagcaaa 

tgcgcagcaa 

aaggcatgtg 

gacccttgac 

ctgtttggct 

gaagagtgag 

ttggctagag 

tctcagccca 

acttttccca 

tgtcctagaa 

agctgacctg 

ggatataagt 

gaaaaataaa 

ggggctgtgt 

taccctggca 

aggtggggca 

acttcctccc 

gcagaaccct 

cctagtttga 

tcactagttc 

gaatgtaaag 

tgtttactca 

ggagagatgg 

ccttgctagg 

agggggtggc 

cacgtgtcag 

agctcttcct 

tgagggcgga 

atttggtctt 

aatccagatg 

ccctctttac 

tctctgggtc 

tgaaatccat 

ctcaggcttt 

catttgcctc 

aggtctgttc 

ttcttgttga 

agaaggagcc 

acccttctgg 

cacttctttg 

acaaattagg 

atgatatatt 

aaatgtacca 

cacggatgaa 

atgattccat 

agcactttgg 

gccaacatgg 

taatcccagc 

tgcagtgagc 

tcaaaaaaaa 

tagaggcaga 

tggctaatgg 

aattatacct 

gtgtacagta 

agtaagtggg 

cctgcaactt 

agtgcagtgg 


atattcctcc 

cagggcacca 

tcattggtca 

ttcctggatt 

attcttctga 

agtccctctg 

ttcatcttcc 

acacacacac 

gaaaagaatc 

gaacttcttt 

tccaggggag 

ccgaagccta 

gttgtcatgg 

aactcacagc 

gccactgatg 

tctgtaaaat 

ggttttagga 

cttgaccgaa 

tatttacttt 

gcttctgtgt 

gtgtgtgtgt 

tgagagggta 

ggaaccctga 

ccaggaaccc 

actcctcact 

gtttcaattc 

cccacotgca- 

tcattgcaac 

tgacccttgg 

caggtgggtg 

agagggacta 

aggcaggttc 

gccctgtcct 

tcttcctgtg 

caccaaagtt 

gttaccctgc 

ctagcctggc 

ctcctaccaa 

tgtgtgtgca 

ctggtctgct 

ccagcatgtc 

cacatcatga 

tcgcctggct 

atgaatgatc 

agaaaaacac 

aaatctcagc 

tcagtatggt 

tcgagtgctt 

aaagccaaaa 

tatttgggat 

tcttgaaaac 

ttaaatatga 

gatgccaagg 

cgaaaccccg 

tacttggttg 

caagatcgtg 

aaaaaaaaaa 

aagtagattc 

ggaaggggtt 

caataaaact 

gacttctgtg 

agaagtgacc 

tatttattta 

tgtgatctca 


tctaccccaa 

tagtgtctgg 

tggaatggac 

tcacctcact 

cattcttcca 

caattgcata 

accacctttg 

acacacacac 

tcagttgcca 

atttcaaaac 

atgacctggg 

tatgtgcctg 

cctcagctat 

tctgggaaaa 

ctggatgacc 

tgagatgaat 

atagtgacag 

ggatttctga 

gggcatttaa 

ctaaggcaac 

gtgtgtgttt 

gggaatctgg 

ctatgattca 

ctctgcttcc 

acacaccccc 

tgcccctcag 

aagtgcagtt 

tgcctagcac 

ggaggacggg 

ggatgggaga 

ggaaagaaca 

atccatccat 

gggtgctggg 

gatgtcgggg 

ctagtggtgt 

tcaaaagctt 

attcaaggct 

cagcggggca 

ccagttcctc 

gggatcctgc 

tgagtccctg 

ccctcccacc 

ctaattaatc 

gttgaatgag 

atgggggtgg 

tctgtcactt 

gggagaaata 

ggcatgtggt 

aaagatgcct 

attatttggc 

attagactaa 

aatatcgggc 

tgggcagatc 

tctctactaa 

gctgaggcac 

ccactgcact 

ggaaaaagaa 

atgattgggt 

tcttttgggg 

gttatttaaa 

agaatcagaa 

acgtgcgcaa 

tttattttga 

gctcactgca 


ccgatagtgc 
gtgaaaagta 
cagtgcttgc 
tctgctctcc 
cccctttctg 
gttcagaagc 
aggacctcgt 
acacacaaaa 
agcagattga 
agaagtgtct 
ggaggaagtg 
gatcctctgc 
gcttcctggc 
agaggagcag 
ttggcctgac 
tttggtgttg 
agtcaggcca 
cagactatct 
aagaaaattt 
tactgtttcc 
gtgtgtatgt 
ctacacatct 
ccccgctggt 
tcttcctct c 
agctagaagc 
cagctgggca 
aatcatttct 
atggtaggag 
ggcaaagagg 
atctggggca 
gtgggaggca 
tcattcaaca 
gctataaaga 
tgtaatttcc 
ctattagcag 
tcagcagctc 
gtcactagtg 
gagcccaccc 
tgcgtgggtg 
tcaaaatgtc 
gcacttggtc 
ccttgcctgg 
tgtgcttacc 
tggattcttg 
gggaggtgca 
actttgtcag 
gacatgcctc 
gggttgaaca 
aatgtccacc 
aatcaaaaaa 
gtgaaagaag 
cagggtgcag 
acttgaggcc 
aaattagccg 
aagaattggt 
ccagcctgga 
aaaaagaaat 
aggcctgggt 
tgatgaaaat 
ggaagaaaag 
atgatttctg 
ctctcatcgt 
aacaggttct 
gcctcgacct 


66240 

66300 

66360 

66420 

66480 

66540 

66600 

66660 

66720 

66780 

66840 

66900 

66960 

67020 

67080 

67140 

67200 

67260 

67320 

67380 

67440 

67500 

67560 

67620 

67680 

67740 

67800 

67860 

67920 

67980 

68040 

68100 

68160 

68220 

68280 

68340 

68400 

68460 

68520 

68580 

68640 

68700 

68760 

68820 

68880 

68940 

69000 

69060 

69120 

69180 

69240 

69300 

69360 

69420 

69480 

69540 

69600 

69660 

69720 

69780 

69840 

69900 

69960 

70020 
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gccaggctca 

caccacatct 

cactctgttg 

tcccaaagtg 

cctcaagtgg 

tcagcaacgg 

tggaggtgaa 

ggagaacaca 

ccttttaagc 

aaagtcaggt 

tcccaaggtc 

tcttcttctt 

tcttcttctt 

tcttcttcct 

cttcttcttc 

tcttgctcta 

ctggccttaa 

caccatgccc 

aaataaagag 

tggttaacat 

tgatctcata 

ctgctcacaa 

tccctttccc 

ttggtcagca 

gtgggcttat 

tcactatcca 

aggtgctgct 

gcgagaggga 

gagccttcct 

ctcactctgt 

ctcccaggct 

tgtctaggag 

ggtgtgacat 

catccaggct 

aatgtctgag 

gattttgagg 

tctctgtcgc 

cggttcaagc 

ccacacccgg 

tggtcttgat 

caggcatgag 

tgctcacagg 

gctggctaga 

tgtgacctta 

cagtttgttg 

ctggagatgg 

gattcctggt 

gagttgtaag 

agtgagtgat 

gttattagta 

ctggaatgcc 

tgtctccacc 

gcaaaaaaaa 

atcaatcttc 

tccaagcaca 

gggctgggga 

tgtccttgta 

tcagagagta 

taggtccttc 

tcaagtacgc 

cgctctgtca 

tcaagcaatt 

gccagctaat 

ggtctcaaac 


agcaatcctc 

ggctaatctt 

ctcaggctgg 

ttgggattat 

aaccaatgag 

gcttccaatg 

ttggccccca 

tgccgtgtaa 

ctattcagcc 

tcacccatta 

acattcattt 

cttcttcttc 

cttcttcttc 

cttcttttct 

ttcttcctct 

ttgcccaggt 

gcaatccccc 

agcctgcatt 

ggacatgaaa 

gttggtatgt 

agcttttcct 

ccccatctcg 

cttccctgcc 

gattggagag 

cacaaaagag 

taatctggtg 

aagaacactt 

gtaagcagga 

cctgccacgt 

cacccaggct 

caagcagtcc 

ccctttttcc 

gggcacagca 

tccaggcata 

tgtatcttaa 

tgttttgttt 

caggctggag 

gattctcctg 

ctaatttttg 

ctcttgacct 

ccaccgtggc 

cccctcagca 

gcagtagggt 

accaccctga 

caaggactag 

actgtccaag 

acctccgctg 

gattgagtgg 

caatataaac 

acatctgtgt 

actattatag 

gaacaaaaca 

aaaaagactt 

atattcaatt 

cagaaatgtc 

agagtgcagg 

ggagtcatga 

ataaggataa 

attaatggca 

tacatgacag 

cccaggctgg 

ctcctgtctc 

tttttttggt 

tcctgacctt 


ctgtttgagc 

ttatttattt 

tgtcaaacgc 

gggtgtgagc 

gaagcaagca 

caacccttct 

tcacaccccc 

gggacagcca 

cagggaagtg 

gcacattgtg 

agacattacc 

ttcttcttct 

ttcttcttct 

tcttcctctt 

tcttctttct 

tgaatgcagc 

tgcctcggcc 

tattctcttg 

aagacgcatc 

tccctcccgt 

tatgtaatca 

tggttctgcc 

tgtgtggggg 

tgcaaataat 

caccttggga 

ctaactgtac 

ctattctgtg 

gaaacaaccc 

ggttttgttt 

ggagtgtagt 

tccccaaatt 

tgccagcccc 

gtgagtcatt 

agatccagtc 

acccaggacc 

gtttgtttgt 

tgcagtggca 

tctcagcctc 

tatttttaat 

cgtgatcctc 

ccgcccaatt 

tattctttcc 

gtgtgggaaa 

acctcagttt 

agagtaacct 

accaacttcc 

cttcatctgt 

ctggatgaac 

cacctggctt 

gcccttcaga 

ttcaggatag 

acacaagttc 

tcgagttacc 

tttgcgatac 

aattaccagg 

cagacagatc 

agcatcttag 

gtaaggctct 

gctactaatt 

tgaaacagtt 

agtgcagtgg 

agcctcccga 

atttttagta 

gtgatctgcc 


tcctgagtag 

atttatttat 

ctggactcaa 

cactgtacct 

actcagagct 

ctcctgtaac 

acagtgccaa 

acagcataaa 

catatgatca 

gggctggtat 

tttactttac 

tcttcttctt 

tcctcttctt 

cttcctcttc 

tcttcttctt 

atcatcatac 

tccaaaagtg 

taagaaagat 

accattaatt 

catttgactg 

aatagtagcc 

acgccttgtt 

tcctagatga 

aaaaacactc 

tattccagag 

tttagctgaa 

agaatcagag 

acaggcacag 

gtttgtttgt 

ggcaagatca 

caaagcttgg 

tcaggggatt 

cctctgacat 

ctctggtgat 

atggcttgtg 

ttgtttgttt 

cgatctcagc 

ctgagtagct 

agagactgtg 

ccgcctcggc 

ttgagttttt 

tgggtccagt 

gctctgggct 

cctcacctgt 

tgggaataaa 

agttcctccc 

aaaatggagt 

gtcaagcact 

gtagcaggtg 

gcgtgcacca 

aaaacctccc 

tttattactt 

ccaaatctta 

ttttgtctcc 

ggcagggcta 

tgggtcctgt 

tgcctttgtt 

acgtgaaaag 

tttattacat 

tttttgtttg 

cacgatcttg 

gtagctggga 

gagacggggt 

caactcagac 


ctgggactac 

ttatagagat 

gtgatcctcc 

ggcacctcct 

ttcacaagtt 

cagcctcagt 

gctgggccct 

ataggaattg 

gccccatttc 

ttaaaccagg 

atttcttctt 

cttcttcttc 

cctcttcttc 

ttcctcttct 

cttttttttt 

ctaaatgcag 

ccaagatttc 

atcatttaaa 

ggaccactca 

gatgtatgtg 

aaaaacatga 

tatccccatc 

cggtgagcca 

agaaggcgag 

aatgacctca 

ggtgctggca 

atgatttcta 

ctctcatctt 

ttgtttgttt 

tggctcactg 

agtgatggtc 

gatgactctc 

tctttgggaa 

aaggagttca 

ttcacaccag 

gttttttgag 

tcactgcaac 

gggactacag 

tttcaccatg 

ctcccaaaat 

atgttctaat 

gtcacctccc 

ttgcaggcac 

aatggaaata 

aggtagcagc 

cacacaagct 

aacaatagga 

tcaaagggga 

tgctgtgtgt 

cacttcacac 

tgcaagcact 

ggaatgggaa 

agccaaagtc 

ccagcagtca 

tgaattcctt 

tatcacgttc 

tgctacctat 

tgctcggccc 

acgcaaaatc 

tttgttttga 

gctcaccgca 

ttacaggcat 

ttcaccatat 

tcccaaagtg 


aggcgcatgc 

tggggagtct 

caccttggcc 

gcaacttctt 

ttgatttcaa 

agagaggaac 

tccatcaggg 

tgtgatgatc 

atagatgaag 

tctgtctggc 

cttttcttct 

ttcttcttct 

ctcttcttcc 

tcttcttctt 

tgaggtgggg 

ccttgaactc 

aggcatgagc 

acagacgaga 

gagataatca 

ataatttaaa 

ttttaaatgg 

caccccctac 

gagggcagcc 

ctgttgtcaa 

tacccgctaa 

ggtcctgccc 

gggaaaatgg 

tctgccctga 

cagatagggt 

aagcctcgac 

ccagtggtta 

aaatgcttca 

gaacattttc 

cagacaggac 

accctccagg 

acagagtctc 

cttcgcctcc 

gtgtgcacca 

ttggacagga 

actgggatta 

cccaaacatc 

aggcctgcag 

tgatcagctg 

ggtaccacgg 

agcttgggct 

ctggcactta 

atactttata 

cctggcatgt 

ggctgcaggt 

cttgtggagt 

cgctttagct 

aacttcaaag 

aatgaaaaat 

atggagagaa 

tcagagccct 

ttagattggg 

aatgcctacc 

tggcacatag 

acattacagg 

gacagagtct 

acttctgcct 

gtgccaccac 

tggccagact 

ctgggattac 


70080 

70140 

70200 

70260 

70320 

70380 

70440 

70500 

70560 

70620 

70680 

70740 

70800 

70860 

70920 

70980 

71040 

71100 

71160 

71220 

71280 

71340 

71400 

71460 

71520 

71580 

71640 

71700 

71760 
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ttggactgtc tgctgcatga 
atcccagcac ccagaacagc 
teagteaget ggtttcacac 
acagatttcc ttcatttctt 
ttggagtaca gtggcacgat 
tctcatgcct cagcctccca 
gatatttttt tcttttcttt 
tggagtgcag tggcgtgatc 
ctcctgcctc agcctcccaa 
tttttgtatt ttttagtaga 
ctgacctcgt gatccgccca 
ctgcactcgg ccattttttg 
gctggtcttg aactcttggc 
attacaggca tgaaccactg 
acaactcagc tcaagtttca 
ttgtccacat acccaccaaa 
aataatcagc ttagttttct 
aggaaccacc acgtggtagg 
aataagcttg cggccaacat 
gaaaaggtag gctaaaatca 
gcagggtagg taacaggaaa 
tgaagggtag cgttatcgtc 
acagagacct taaagcagtt 
aggaatgagg ccaggcatgg 
caggcggatc acctgaggtc 
tctctactaa aaatacaaaa 
teaggagget gaggeaggag 
gatcgtgcca ttgeattgea 
taacacagga acagaaaacc 
tgaaaacaca tggacacaga 
tggggggtaa ggggagggaa 
acacgtatac atatgtgaca 
aagaaataaa gaaaaaacca 
ataaaaaata ggccaggcat 
egegggeaga tetettgage 
tgtctctaca aaaaatataa 
gtcatcccag ctactcaggt 
tageagtgag ctgagatcgt 
ctctaaataa ataaataaaa 
ttccatatct gaatcaatag 
tcctaaactc tgttctcctg 
gctgtcatgt gggctcctgg 
acagggtggg tctaaatcct 
ctccatctca tgagcctcac 
tcaattcatt atttaatgaa 
gggcacagtg gctcacgcct 
ctgagtcagg agtttgagac 
tacaaaaatt agctgggtgt 
ggcaggagaa cgcttgaacc 
cactctagcc tgagcaacag 
tgaagaatgt acataaaatg 
agttaaacat agagttacca 
agagaattga gaatatttgt 
ttcattacag ctaaaaagtg 
aaaatgtggt atacccatac 
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aatgtcatat 

cttgataatg 

gcatttatat 

tgccagagac 

tttcttcttg 

tgtgaatata 

gtctcgctct 

ctgcctccca 

atgtgctacc 

gcccaggctg 

gctaggatta 

attgtatggt 

cctttgcgtg 

gagaacacag 

gaagcgccat 

atctagactg 

tgtcagaatc 

acaacaggag 

ataggatgca 

ggatcttctg 

cctccctaac 

ccagtggctt 

catccagcct 

ctaccatccc 

accgggtctt 

atcgagatgt 

aatgcacaaa 

ctctgtgtct 

cctcaagttc 

atcttttgtt 

ggctggagtg 

agcaattctt 

cggctaattt 

gaactcctgg 

tgagccactg 

cctcttgtag 

catctgagtt 

caagggcctt 

acccaccatg 

accttctggg 

cctgctgaac 

ggctgacctc 

gattacaggc 

nnnnnnnnnn 

nnnnnnnnnn 

aaaaaaaaaa 

tagtccaagc 

tgcagtgagc 

cctaaaaata 

aaggatgaga 

ttagcaattg 

aatactatat 

ggaagagaaa 

tattaagtat 

aacccaaaag 

ctaccatttc 

gggcaagaaa 

aactctctag 

tcaatggcca 

acatgtcaaa 

aaggtcacta 

ttatttctgg 

aaaagtagaa 

agaactttgg 


tatttggcca 

ttattctaag 

gaagtgccca 

tggagggagg 

gggtaatgaa 

caacatgctg 

gtcacccagg 

ggctcaagca 

atgcctggac 

attttgaact 

caggtgtaag 

gtgcgaatta 

tgtcaggcac 

agcccagcag 

ggaagaaaga 

gggctcaggg 

agtagaattt 

gatgcggaga 

gagcctgtgg 

agacaataaa 

ttgcttgctt 

aagacacgaa 

atcagcgagc 

ccaagcccag 

cccaatccta 

agatttgatc 

ctcccaatga 

ttgccatggt 

ttcctgggct 

tgtttgtttg 

caatggtgca 

ctgccttagc 

ttgtattttc 

cctcaaatga 

tgcccaggca 

taactagcct 

tccctttgat 

tgcccacctt 

cctcctgcct 

aggcagtctt 

atctcttttt 

aaactcctgg 

atgagccact 

nnnnnnnnnn 

nnnnnnnnnn 

aaaaaaaaaa 

tacttgggag 

cgtgatcgta 

aagaagaaaa 

aaggaataag 

tattgttatc 

atgtgtatgt 

gggctacaaa 

ttatgatatt 

cagtgacaac 

caattaaaga 

cacacctagt 

aaataatgtg 

aaggtgagac 

tatattttaa 

gggcaccaac 

gtaacaaaat 

gatagaaata 

gaggccaagg 


taaaaaggaa 

tgaaagaaac 

gaataggcaa 

agataatggg 

aatgttgtac 

aattttatac 

ctggagtgca 

atcctcctgc 

aatttttgca 

cctggactca 

ccaccactcc 

tatctcaatt 

tagggattcg 

gagagagtca 

catggtgcca 

aggaatcttt 

atcaatcctc 

cattccgaga 

tgtgaggttg 

gagtatgcaa 

ccactgttgc 

actcaagtca 

ttgtgggtct 

atcaccatca 

cccccgcaac 

aggccacatc 

tggcccctga 

aacctccttc 

ggaacattct 

tttgtttgtt 

atctatctcg 

gtcctgagta 

agtggaggtg 

ccaccctcct 

acacttcaga 

gactccagca 

aaaactcatc 

cctcaccact 

gacaaattcc 

ccctgagcac 

tgaggggcgg 

cctcaagcga 

gtacctgacc 

nnnnnnnnnn 

nnncctgggc 

aaattatttt 

gctgaaggga 

ccactgtact 

tatggcaatt 

gaaggacaga 

aagtaacatc 

acatttacat 

tgtaaaatca 

tttcttataa 

ccaggagcaa 

aacccagggc 

gaaatggacc 

atttcatcta 

aatttgagca 

aatgaaatat 

tcatatttca 

agtaaggaat 

gaaaatcatc 

caggtggatc 


gtactgatac 

cagacacaaa 

atccacagag 

aaatgtggaa 

aattagataa 

tttattatat 

gtggcacgat 

ctcagcctcc 

tttttagtag 

agtgctccgc 

cggcctaaat 

taaaaaaaac 

atgctgaata 

cagatgaatc 

tgagaacata 

cagggtgatg 

cagaggagga 

gctgatcaag 

cagctggaaa 

taatctcaaa 

ccgtatcata 

tcccaggctt 

taccacgtaa 

gctcaggcct 

atgaccccaa 

ccttgaaagg 

gtcctgtgcc- 

acacccatta 

ctgcactagc 

tgtttttgag 

gctcactgca 

gctgagacta 

ggttttcacc 

cggcctccta 

tcttaatgat 

atgaatcctt 

atatatttgt 

atcctctcac 

taaccatcct 

caagacaatg 

gtagagatga 

tcctcctgcc 

gcaactgggt 

nnnnnnnnnn 

aacaaagacc 

aaattagcca 

gaggatcact 

ccagcttggg 

tgactgtaca 

aaaaaggaaa 

aatattggta 

atgcatatgt 

aggaaataaa 

aaaaattata 

taaaaaacct 

tagttgggaa 

tgaacattta 

aaagacacag 

tcaagaagaa 

tataaaagaa 

aactggtaaa 

gtttttcttt 

cttttggcca 

actggagatc 


atgctacaac 

agaccacata 

acagaaagta 

tgactgctaa 

tggtgatcat 

tttatttttt 

ctcagctcac 

tgagtacctg 

agacagggtt 

ccacctcagc 

tgtattcttt 

aaaacaaaac 

agacacagac 

aagtgttaca 

cgcttagaga 

cttgtgctca 

aacagcaaat 

ggctggtgtg 

ggtaaaacac 

cgaccgaaac 

aaagcaccac 

tctccccacc 

agacttctca 

ggactcctgc 

tagcccatca 

cttcctgtga 

atctgggtct 

atactccatg 

ctagccaact 

acagtcttgc 

acctctgcct 

taggcaccta 

atgttggcca 

aagtgctggg 

catttccttt 

ttgcaatgta 

tcctctgaca 

cacttaacac 

tcaaatctca 

ggacacattc 

gtgtctcact 

ttggcctccc 

tagnnnnnnn 

nnnnnnnnnn 

cttcctctac 

gacatggtag 

tgagcccagg 

caacagagtg 

tctctaatgg 

caaagaagta 

aaaccagtaa 

taggaaccaa 

aatttgaata 

tatatgttaa 

cagcatccag 

aaatgacaat 

attgtgttag 

attctgggct 

tcatgacaga 

acaattagta 

taaatgtgta 

ctagaagaat 

ggggcagtgg 

aggagtttga 


atggatgaac 

ttgtatgact 

gattagtggt 

tggtatgggg 

tgtaaaactt 

ttgagacaag 

tgcaatctct 

acactacagc 

tcgctatgtt 

ctcccaaagt 

aaaagactga 

aaaaaaaaaa 

cctaccctca 

tcatctatag 

agggaatttG 

gagttttcca 

gaaaaatctt 

aacaaagcac 

taattacatt 

tgaccttcct 

cctcttctac 

tcactctcca 

tctccagcta 

aacctttcta 

gaatggacta 

ccctcgggga 

gccctctgcc 

ctctctccta 

aaccctttag 

tctgttgcca 

gccgggttca 

ccatcacgcc 

ggctggtctc 

attacaagca 

aagtgcctga 

acctatataa 

gttcagaggg 

agaactcacc 

ctcacctatt 

ctttatacac 

atgctgccca 

aaaatgctgg 

nnnnnnnnnn 

nnnnnnnnnn 

aaaaaaaaaa 

tgcatgcctg 

aggttgaggc 

agacctcatc 

gatatatcct 

gcaacagtat 

ttatatttaa 

gtttatcaga 

aaaatatcag 

ctctatccaa 

actgtagtct 

ttcatgtcta 

aaagtaagga 

ggtaaagttt 

acagattaaa 

gccatccctg 

agccaagcat 

tctagtgatt 

ctgtaatccc 

gaccagcctg 
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gccaacatgg 

ctgtaatgcc 

ggtcgcagta 

gtctcaaaaa 

tgggcctagg 

actttatagt 

taacaagaca 

ccactgagtt 

ttctaatcct 

aaaacaacgc 

aaaaggtagg 

tacactgcat 

ttgaaatgat 

gttatttttt 

gttagagata 

gatttgcttt 

tagatgaaat 

aaatacctca 

gatgtggtaa 

ccccagtgct 

tctctcattc 

ttcatctcct 

aagccttccc 

ccccccatgg 

ctccaccctc 

gaatccctgg 

aatgaatgaa 

cttcacgcct 

gagttcaaga 

gggggatggt 

cttgaaccca 

ggcaacagag 

agatgtcatg 

cccgcatctc 

gcaggccatt 

aggtgctgac 

ccaggcaggg 

accctcctct 

gatgggctac 

tcagcaagga 

acaccaggtg 

gcttctgccc 

ggaaagtgga 

cccaccttct 

cacacatccc 

gttcttctct 

taataccttc 

gtcagaaggg 

cctttctttt 

catggtccag 

gatggttcct 

aaataggtcc 

accctattct 

atggtaaaaa 

aggtcatagg 

catcatctca 

aagtcccaaa 

aggcctgttc 

tctcccaccc 

agacgcagaa 

cggacccagg 

agaagagggc 

aaggggtggg 

gccctggacc 


tgaaacccca 

agctactcgg 

ggctgagatt 

aaaaaaaaga 

cattgatcat 

ggatggatca 

acagacacca 

attcacggca 

gttgtttaca 

caaatccaaa 

ggggaggact 

ggaccaggca 

tggaggaatt 

aggtgtgatc 

catgtacggg 

gaaaactcta 

aagtatgcaa 

gcatacctct 

acagcctgac 

ctgtgcccta 

accattttga 

ccctccttac 

tgacaccaat 

ccccctgtigg 

tagactgaga 

tgtctggcac 

tgagtgcaac 

gttatcccag 

ccagcctgac 

ggcacacgcc 

ggaggtgcag 

taagactcta 

gggtaaatga 

acctgtcttt 

ccagggggca 

tgcagaagga 

actctcccac 

ggtgtgtaca 

ctcccagccc 

gttgagagtg 

ccaggcaaag 

tgcccctggg 

caggtaggat 

acccttggct 

cacctccctg 

attgaaatcc 

ctgatctttc 

agaggagaat 

gataccagta 

atgggaccat 

cctaacaagg 

tacaccctgg 

tctttcttta 

taaaaacaaa 

atacatatag 

catttctacc 

cacaatgcag 

atcctgctca 

cgttcttatt 

cagagaggaa 

agacctggag 

agaaccgagg 

gaagtgatag 

ctccattaag 


tctctactaa 

gatgctgagg 

acgccactgc 

aagaaaagaa 

taatggctcc 

gactcgcaat 

ggggctgctg 

aaaaaaaaaa 

ggaaataccc 

atatggggaa 

gttctagatt 

tggatcctaa 

tgaacagtga 

atgattttat 

tatagagaaa 

gcaggtgtgg 

aatgttagtc 

actcctccaa 

cccttcttac 

gccagcccga 

ccccactgtg 

tcatccttca 

cccatcctca 

gtttgcttca 

acaccttgag 

catggctgac 

tagacagggt 

cattttggga 

atggtgaaac 

tgtaatccca 

gctgcaatga 

tcacaaaaaa 

agacctccct 

cttgcttcct 

cacagcactc 

agagagaggt 

cttacccttg 

gccctgctgg 

taccacatac 

ccaatgaaga 

cgccgagaga 

gtctctaatg 

ggggagtgtg 

cccagaactc 

ctttgtcatg 

gatctgtctc 

tcagcgagaa 

gagtgtggat 

tcctgagttt 

gaattcagtg 

cagaccggcc 

caagttcaat 

aaataaataa 

aaaatcaatt 

ctactaagat 

tgttttgtga 

tttgatgacc 

tctgctcctt 

tctactgtag 

ggaaaggaag 

aaatgggggc 

agtgggggga 

gatgcgaaat 

tgagagggtt 


aaatacaaaa 

caggagaatc 

actccagcct 

aatcatcctt 

taaaatcact 

gtctaaacca 

acaggagaac 

aaaaaaaatt 

aaggaaagga 

atgacccagt 

ttaaaagcta 

ttggaacaaa 

ctgcacagta 

ggtgatgttt 

tatttacgga 

gagaagcagg 

tacttttgtc 

ccaggtccaa 

ctctctctct 

ctcatctgcc 

ctcctgggcc 

ggtcttggct 

cctagaacag 

cggattataa 

aaaaagaaca 

acataactat 

taagaacaaa- 

ggctgaggcg 

cccgtctcta 

gctacttggg 

tctgagatca 

aaaaaaaaaa 

tcctggttcc 

tctggtactt 

cgaactgagt 

tggttgatga 

tctgcatgtc 

gaggctctgt 

ctcgtgccat 

ggtagtgcca 

aagcgtagct 

gggagaggag 

gaggcttcaa 

acccggcgtg 

ctggccctac 

tcacagcctg 

aggtgaaagg 

actctgttct 

ctttgggaaa 

ttctgttcct 

atgagtttag 

gtcctccctt 

ataaatacat 

ttattctgta 

ggttactaca 

caagagcagt 

tacagtcttc 

tttgtccttc 

ctagctgcgg 

cagtgataga 

actgtaccag 

agggaaggca 

agagaaaaga 

gggaagatgc 


attagccagg 

gcttgaacct 

aggcaacaca 

ttgcgatcct 

aagtatatgg 

gttgatcaat 

agaggaaacc 

aaactgcgtt 

atacttttaa 

ttcttcaaca 

tagaagacac 

ccaactgtaa 

gatgatatga 

aagtaaaaga 

tgaaatgata 

tgcatgggtg 

cctcctgacc 

ggatcaggcc 

ccagccactt 

cagattcctc 

actctccaag 

taggtgccat 

attatgtgcg 

ctgcctgact 

catctatctt 

cactcagtga 

agagaagacc 

ggcagatcac 

ctaaaaagac 

aggctgaggc 

caccactgca 

aaaaaaaaaa 

ctgaccagcc 

ctgtttaagc 

gaaggcaaag 

gaagtttcca 

cctcctcccc 

gttcccagct 

gtccccaccg 

caactgtatc 

ccagaaggag 

gatctgggcg 

tggaacattt 

ctctgcacct 

caccttggat 

gtcaatgaca 

aacgacaagg 

aacctgctcc 

tcttcctcta 

ctctcaaggt 

attaggatgg 

gattttggaa 

gttttggatc 

tatttaagat 

gttaagcaaa 

taaaatctac 

gtgctgtgca 

gacctgcatc 

tttgtgatgt 

gttgggacaa 

acttagtgca 

acccatggca 

gggacagaaa 

ctaaggccct 


tagtgggtgc 

gggaggcgga 

gtgagactct 

aatgaaataa 

ttgatgggaa 

cttaacatcg 

catagctcta 

tcctccaagc 

atgacacatt 

aataaacaag 

agcaaccaaa 

aaggatgtat 

agaaattatt 

ggccttattt 

cgatgtctga 

ggggaaggga 

cagcaggtta 

aaaactccct 

ccctaagatt 

aatgtttcac 

gccccgcctc 

tccctccagg 

cttctttgtg 

acctgccttt 

gtctgtcatt 

ctagtgtttt 

aggcatggtg 

ccgaggtcag 

aaaaattagt 

aggagaatct 

ctccagcctg 

gagcgagaga 

cctgccctcc 

cggtccatga 

gcatttgtgt 

aaactccctt 

acaccatcag 

gggacatgca 

gccatgttgg 

acagacagcc 

cggagttcct 

tgaattacga 

cagatccggg 

gcctcggact 

gaccctctgt 

cttcttgcag 

agaggagaaa 

tcagcacctc 

ccctaatctt 

taaccaatga 

acttaatcta 

gcttcccaga 

caattgtcag 

atacaatatg 

ttaacatatc 

ttgtttagga 

ttagatctct 

tcccatctcc 

gtgtaaccaa 

taagagaggg 

atggcatcac 

ggcgggcttc 

agagacgaaa 

ttttctgtcc 
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tgcctttcct 

aaggagaggc 

tgaaaggaac 

ctttgtggtg 

aggactgtga 

ccaccgcact 

ttaaatcaga 

ctagtgattc 

ttcaaatccc 

gagcctcatt 

gatgaaatca 

ttcctctact 

atttccccca 

ccttgtgaac 

aaatctcaag 

atgagcccac 

ccaaacctat 

atcagatgta 

agccagctac 

tttgagatag 

tcctgtgata 

cccacaatca 

gtgcccccgg 

gaagatgaag 

gaaaaagtca 

aagcaaacta 

atggaatatt 

tcccagcact 

catagtgaga 

catacacctg 

gtttgaggct 

gaccttgcct 

catggatgaa 

tagtgcttta 

gggaaacagc 

gtgcacgcct 

tacatgcctg 

agttccagac 

agtggggcat 

tcgcttgaac 

etgggGaaca 

tgttatgtat 

gttttctaag 

cctggtacac 

acgacccatc 

cagtgccttt 

aaattcatct 

cttggtgact 

accaccgacc 

gtgcagtgaa 

gaattcctgt 

catggtggct 

aggtcaagag 

caaaaataag 

ggagaattgc 

tccagcctgg 

aggaaagaaa 

gtgactctta 

ccctccaaac 

agagatcctc 

ctaagcccag 

cactcagcca 

catggttgcc 

ttgtttttac 


gattctgggt 

ttgggtctgg 

tcctttgact 

ccattcccag 

gccgaacctg 

ttggaatcac 

atgtctggag 

taataagcag 

agctctgcct 

ttttgcattt 

gagagcacat 

ccttgtggca 

atttgaatga 

tgcttcagcc 

aacgcttgtg 

gctggcttgc 

gtgagtgagg 

tgaatgagcc 

ccacagactt 

tttgttatgc 

ggtttgcttt 

ccagatagtc 

ggcatcagga 

gaggccctga 

gtgtcacacc 

agggtccatc 

attcagcctt 

ttgggacgca 

ccctcacccc 

tagtgccagc 

gcagtgagtc 

ccaaataaaa 

ctttaagagc 

gaagaccaag 

aagacctcat 

gtaattccag 

taatcccagt 

cagcctggcc 

ggtggcacgc 

ccgggaggcg 

gagcgagact 

attttattgc 

gagtcctagt 

acatgccccc 

cccacagccg 

tttcccccaa 

tacaaaacac 

ggtatgacat 

tagcccaacc 

ttgctcaata 

tcctgctccc 

catgcctgta 

ttcaagacca 

ctgactgtgg 

ttaaacccac 

gagacaaagt 

gatgaggcag 

tgccaggagt 

ctccaaagct 

ttctctgccc 

agagaagtcc 

tctgtggtgt 

acaactccag 

tctgctttca 


ccctggggga 

aggagagatg 

gtagctccct 

ggcacaggct 

agactcagac 

cttttagcca 
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caaagtttag 

cttcctagca 

agaaaatgga 

ggggtctgac 

aatgactgca 

gactgcagct 

aaaagaatgc 

catttctgca 

tggaggatga 

ccatcctagc 

ctgtcaaaat 

gtgaaaaata 

agcaatagct 

accctgtcca 

cttacccata 

ggaaagtgcg 

ggctgatgcc 

cacgttcata 

agcagatgaa 

aaaaaaggaa 

cgaggatcac 

ttctctagaa 

tactcaggag 

atgactgggc 

aaaaaaaagg 

actatgcagg 

acaggaggat 

ctctactaaa 

ctacttggga 

actttgggag 

aacatggtga 

acctgtaatc 

gaggttgcag 

ctgtctcaaa 

atacaaaaac 

acaagccatc 

acccacccca 

gtggagcctc 

cgacagcaaa 

accaatgggg 

caaagagaat 

tcctcccatt 

aattgacaga 

tttccccttc 

atttcagcac 

gcctggccaa 

tggcacgcct 

taggcagaga 

gagactccaa 

ccatctgggt 

agaaaggctt 

tatgggacta 

ccataacctg 

caggttggat 

gctggcttta 

ctatcacatc 
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ggttagccag 

gttttcatgc 
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gataaggata 

aattagtaag 

aaaatgatca 

ctatttcccc 

agcagaagtg 

ctctttcaga 

cagccacgtg 

atagccagcc 

catctacatc 

ataaatgctt 

aacagatgca 

tcccacccta 

gctgtagttg 

tgctacatgc 

tttttcataa 

gcagcactat 

cagctaaaca 

agaggcaacc 

ctgagcccag 

aatttttatt 

gctgagtggg 

cactgcaccc 

aaaggaaaga 

gccaagctca 

tgcttgagtc 

aataaataaa 

ggctgaggtg 

gctgaggcaa 

aaccccgtct 

ccagctactc 

tgagccaaga 
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agacacaccc 

ctcgtgtggc 

ggccttttgt 

agcttgccag 
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ttacagagag 

gtaagcagca 
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tttgggaggt 
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ttgcagtgag 
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gctacctgga 

tttaagagga 
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tctttggctg 
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ggaggaattt 

tctgaaacgt 

gtcttttctt 

acaaacaggg 

ttaatgcatg 

gtgtggaagt 

gttaatctgc 

tgaagcacgg 

cttgactctt 

cactaacaca 

cccatgttct 

tcctgcgtgg 

ggtctgacgg 

acaccttttt 

tgtgccaagt 

ctcttgtggg 

aggcagatgc 

ggaaccagct 

tctaagtgag 

cccttaatat 

cacagaggca 

gttcccactg 

tggtaaggac 

agggagcata 

tttccttccc 

gacagagtgg 

tataagtgag 

ctcaagagca 

tcctctgaac 

tcctaccctc 

ataaagggct 

tcacaatggg 

ttctacagcc 

cagagaaaat 

agccaattac 

tatggattaa 

tactgttgtt 

tagggagaat 

aagtaatggg 

tagatgacac 

gcttggattg 

gatggagtct 

gagggtgggg 

tttctcttta 

cactctggcc 

cagcctggat 

actggaagca 

cttatgaggc 

caactgttta 

cactgacaat 

ggaggcacgt 

tgagcactga 

cgataggtgt 

aactccccca 

ctgaaactaa 

tcaccaaggc 

tcaagtgatt 


tcccctgagg 

agaaatagag 

gcacaaaaga 

tggggtcrctg 

ccccacctgt 

taagatgggt 

agattttcta 

caaaggaact 

atgctctcat 

aaaataacat 

cttaagatgc 

ccctatcttc 

agcaagggag 

aggagcctgc 

gacacggggc 

agagcctcca 

agtcttactg 

ttggttcgaa 

ctccaatgag 

tgcacccgat 

ttttttaatt 

gccgggattc 

agagacagac 

tagcacagca 

agagcacagg 

ggcattaaga 

ctctcctatc 

tcatctgctt 

gatggcttca 

cagtccaggg 

gaactggatc 

atcatcagca 

gaagatggat 

ctgcactcca 

ccaccttcca 

aacctcagcg 

tgacatcact 

ttgatttcgg 

ctcatgtaaa 

tgttttggaa 

tctaatcctc 

tggtatctct 

agataatgta 

aacaaatatt 
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tgagctcagt 
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cctcagagca 

cagcctccct 

ctgggccagg 

tactgctacc 

ctggtaactg 

atctaaacct 

cctacccctg 

ctatgtatct 

tgttattagc 

acatcaccag 

tgcttttttt 

tggagtgaaa 

ctcctgcctc 


gcccaggaac 

gttacgatag 

taaaagtctg 

taccccaagc 

aagacgagga 

gtatgaggtc 

gtctgcagtt 

tgcccacggt 

caccccgtct 

tatttatggg 

aagattttct 

tgctctggca 

aaacagaaaa 

cagaaagatc 
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ctccacatct 

aactgatggg 

ttccagcagc 

gattaatgag 

gtcttcatcc 

gactgattgg 

gtagagaaga 

aaagaaaccc 

cattgaaagg 

gtggtgaaga 

attcagccca 

tatccgtggc 

ccaagccatc 

cttaactgct 

tctggggatg 

tcctgcatcc 

agtctggata 

atgtttaggt 

tcattcccct 

atgcagtccc 

agggcaattg 

ttctaaattt 
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ttaaaatgtt 

gtcagaaagc 

ttgaggcagt 

gagcttccgt 

tgaaaactgc 

attgacttac 

tcctaaccta 

ttaccttgct 

tcagtacaac 

tgtgggaagt 

tcttcagggt 

tagccaagta 

atctgaaagg 

gccagctcag 

ttcaccatgc 

cctaagagaa 

caccctccca 

cacttcccga 

gcaactctaa 
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gattcgaagc 

atctattatc 

gtagcagtgt 

tttttttttt 

tggtgtgatc 

agcctcccga 
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gggacagcca 

caaccaccct 

tactcactag 
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atgtagacag 

cattctacaa 
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attttactgc 

tacattttag 
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tagctcatgg 

gggttctgtc 

cccatcaatg 

acaggaaatt 
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ttttctcttc 
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aagtggaaag 
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gctttttgct 
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caggactggg 
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tcagttcact 

gtagctggga 
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agtgacttga 
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agctgcagta 
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aaagagacag 

gctatctgta 
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agaatatcct 

agatgctatt 

gagagaagtg 
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tgcaagagtt 
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gacattgctt 
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gcccagctaa 

gtctcaaact 

caggcgtgag 

cctgggaggc 

accagttatt 

aacaagagag 

accacaccca 

actcctgagt 

tgaaaagagc 

caaagggaag 

tagggattag 

gagagcacat 

caagtctgtg 

ggaagtggga 

gcacctttct 

tagccctgac 

gtcgtggggt 

atggtgagtc 

aaaccacgga 

tgggctgcaa 

aaagcattct 

acaaaattgc 

ttctcaatga 

tgagtgctgc 

tgatgcccac 

tgaagaaact 

ggaagagata 

cttttcctgt 

acagaagtcc 

gacatctctg 

ttgttgtgat 

gataaacctg 

gaatcctaag 

gaaatcaata 

ttgacacata 

atatccaagg 

gatcctatga 

ttccttgttt 

tgagacccac 

tcacttctgt 

aaaacatgga 

ctctgcaacc 

tagcacccac 

gaacagtgtc 

nnnnnnnnnn 

nnnnnnnnnc 

ggtctcaaac 

acaggcatga 

cgggaagtgc 

ggattctcca 

aggacagtct 

accagagtcc 

ttccagagat 

cccacaagct 

taacttggta 

gctcacatct 

gagttcgaaa 

ttagccaggt 

aattgcttga 

gggtaggcaa 

ctttattctt 

tgataatatt 

ctatccatct 

aatgagaatt 


tttttttgca 

cctgggctca 

ccaccatgcc 

ttcccaagag 

actaatagga 

tagtgctgcc 

cccacacgca 

gaaaacaacg 

aggaaactgc 

ccggaagaga 

ctggagggag 

gtaggatggg 

gtgaaattag 

gacccagggt 

ctcttggggc 

ttgctgaaat 

gctgcaatgt 

cgccctggga 

agttcccggg 

aaccgcaaga 

gccctccagg 

ctcaggggca 

aagcaccctt 

ttttgggaca 

agccacccca 

gaggcctcaa 

gacttcccag 

gccaatgtga 

tctctgtgtc 

ttatactact 

taagtatagg 

aaatgcaatt 

gctgactggt 

ccagaaggtt 

ggtgaagctt 

cagggagcca 

agatcttacc 

gcttagatta 

tgaagaggaa 

gttgtcactg 

cttggaagcc 

tcgggcagat 

ctcaacatct 

tgccatctgg 

nnnnnnnnnn 

taatttttgt 

tcccgacctc 

gccaccacgc 

ctgtggggct 

tgacctctgt 

gatggctggg 

ttccgataca 

atccatgcat 

tttgacatca 

tctctggtac 

gtaatcccag 

ccagcctggc 

gtggtggcag 

acctgggagg 

caaagcgaga 

tttgatggct 

taagttatgt 

gtaaatgtct 

tctgggtcaa 


ttttttgtag 

agtgttctgc 

cagccagctg 

ctacacagag 

atgaagatgg 

tctctgtgat 

tgatcatgaa 

gggtttttca 

tgggg cc gat 

ctggaacagt 

aaggagaagc 

aaccccagat 

ggaaggctcc 

ccagtcctgg 

ctccattcct 

gctgggactc 

tcctccagat 

gcctctgctt 

gctgatgggt 

tgggctgtga 

tcttaaagca 

tcagggaggc 

ttgggtatta 

ggcattgggc 

taaggaggcg 

gagatgaagt 

tatccagagc 

tctgcctcca 

ctcttcagcc 

ccaatggtgt 

ggaaatgacc 

tattaattaa 

cagcacaatc 

ctttgttgag 

gacatacctc 

tctgggtgtc 

tttctggctg 

atttctgcgt 

taatcaatac 

tatcctcatg 

agaccgtctg 

tacctaagtc 

gtcaagagga 

taagcagtcn 

nnnnnnnnnn 

atttttggta 

aggtgatcca 

caggccccac 

ggtgtggaca 

tactcatatt 

caaaccctgc 

tgattctggg 

gcatcggcat 

atagtagcat 

tgctcaatat 

cattttggga 

catcatggtg 

gcgcctgtaa 

cagatgttgc 

ctctgtctca 

gtaggtggat 

cttcagcatc 

ttttgaacat 

agaggatatg 


agatggggtt 

ctgccttggc 

atgctcttaa 

catgagttct 

gtaattcttt 

tctgtgtctc 

aagaggaaat 

cattgagagc 

tgaacactgg 

ttccatggtg 

tggggtggga 

gatactcaag 

actgcggact 

ctctggagcc 

acctctgtga 

tgtacagagg 

gctgcacggg 

ggaagctgaa 

tcttatagat 

ccactctcaa 

aacacagact 

agcaggcctc 

ataatgacaa 

taagagctat 

gaggtactat 

aacttggcca 

ccatgttttc 

ggaatcctgt 

tgatagtata 

tccctcccct 

cacactaaac 

cactgagaaa 

tctttcagga 

tacaaagtca 

tataaagcct 

ctctcctttg 

catttatcat 

atttaataga 

atactagttg 

cttccttata 

ggtttgaatc 

agtttcccct 

ttcaatgagg 

nnnnnnnnnn 

nnnnnnnnnn 

gagacagggt 

cccgcctcgg 

acacctttta 

tgtgggtgaa 

cccacactcc 

ggcaaaccat 

cagctgttgt 

atgtgtataa 

attattaaat 

aagaacatat 

ggctgagatg 

aaacccccat 

tcccagctac 

agtaagccga 

ataaaaaaaa 

gttctaaaat 

atatgaaact 

ttctagaata 

cattttacat 


tcgccatgtt 

cccacaaagt 

tctgtgccct 

ggaatcgggt 

cagacagcac 

cctgccctgc 

ggatccagga 

tttgcccaac 

acttttgttg 

ctggaggatg 

ggggaacctc 

gcatggcatt 

gtagacagag 

ccctgggtgg 

agcgagtgct 

ctgacattaa 

agagggcaga 

gtggcctgag 

tgtacatgca 

ggaaagagcc 

caatccttat 

aaatgtgtgc 

cagtaatgac 

atgtaatata 

cattatgcca 

agtcactcag 

accattatgc 

cttgatgttc 

tcttttcata 

acccctccct 

aaactcataa 

tgaaaccacc 

aggacaggct 

gagggaaggg 

ccatcctgcc 

gacagtgctg 

gattgtggaa 

actgaaaggc 

tgttgccctt 

atggagggac 

ctggctctgt 

tctctgaatt 

gaatacacat 
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tttgccatgt 

cctcccaaag 

aacaaccaga 

ggtggcac:tg 

tcaaattagc 

tccccagccc 

tacccgtgtc 

ttattatatc 

tgttctgtac 

agacctggcc 

ggtggatcac 

ttctactaaa 

ttgggaggct 

gatcacgcca 

aaaaggaatg 

ttgtgtaacc 
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aaaggcctat 

agtgactcag 
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tccttttaag 

ctttctagaa 

agtcatttac 

tattattatt 
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ccagtaaatg 

tgaagtacct 

ccttccccat 

ccattctttg 

gggagcttag 

gagactgatt 

cagcagatgg 

tttgggaaag 

agttgatgga 

aaggatcaga 

gatttttctg 
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aatttcccat 

tgcagagaat 

agagatggta 
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ggggatataa 

aaagtgctca 

nnnnnnnnnn 

nnnnnnnnnn 

tggccaagct 

tgctgggatt 

tttcattcat 
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ctgagtctcg 

tgccctctca 
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tatatttcat 
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aggcacagtg 

ttgaggtcag 
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gaggcaggag 
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tatagacctt 
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tgtcttccaa 

tttccccaca 

gaagctgttg 

agagaggtta 

cgatagcaga 

gaatggagga 

ccaccctgaa 

agtatttgga 

gaacattata 

atataatgta 

tttggcaggc 

catagcgaga 

atgcatgctt 

gagtttgagg 

aggaagacca 

aaaagcagag 

ctgttactta 

aacatataaa 

gagataatgt 

ggaaattcat 

ctttgatctg 

aggtataggg 

accaggcagg 

cccaaagggg 

tttatcgcgc 

gcttttacaa 

tttaaaatta 

tagcgataca 

tcagctcaaa 

gtaaaattag 

gataatgttt 

gaggatgatc 

caggctggct 

actgcacact 

agggtatatg 

acaagatttt 

ctaggaggac 

gtctcaaaaa 

acgtgctcac 

tcatcacagt 

tttgttttta 

tcttcaagta 

catattctcc 

tgcaaattct 

tttctgcata 

aactgcagtt 

gcctggagct 

ttagtcagtc 

tccagagttt 

agtgctgggt 

cacacggctt 

tgtttactct 

ctgttgaagt 

ttaattgctt 

aactacaccc 

gtcactggcc 

tcccccacta 

ccatttgccc 

agtggcttca 

cagaaatgag 

ggtgcctata 

cagagtctct 

tcacggccac 

ttacccactc 


agtggtcgta 

ccctggagag 

gccgttatat 

gttagtgggt 

ggagggtgac 

cttaaaatat 

tgtgccagat 

tgggagaaat 

cattctatat 

tcagtaaaga 

cagggtggga 

ctctgtctcc 

atagtcccag 

ctgcagtgag 

tgtttctaaa 

tggtaagaga 

cccatcgtgt 

atagaagtaa 

atataaagct 

tgttgtagtg 

gatttggctc 

tggtctgagc 

tgctgataag 

ggtgtacaaa 

actgaaaaac 

aaaatgctaa 

ctaagtcata 

ttaatgactg 

ttctgcaatt 

gcaggagggc 

gtttttatct 

tggctgtgac 

gggcaggtgt 

tggtcaaaga 

agcacctgtt 

attgtgcatg 

tgcttgagcc 

aaagaaagaa 

ttctctcttt 

gagcacaggc 

cagaactcat 

aaaggaaaac 

agactttgtc 

tttatcctga 

ttctttataa 

ttcttaacca 

tttttttgtc 

aaaaagtaat 

tgcacactgc 

gatggtgttt 

tcatttctat 

ctgcataccc 

cttgaggctt 

ccagccagat 

tggacagatt 

tactgtgttt 

ccagtagcag 

cacctcctac 

acccacagaa 

aaaagcgttg 

aggaaatggg 

tgccccatga 

tctaggcagg 

tgctgaacac 


ccaattaaca 

atgaaaaatt 

aaccctcgtc 

gcaaacatac 

tatagttaac 

tccaacacat 

cttgtttgtt 

gataaatgct 

atgtaacaaa 

gagggctggg 

ggatagcttg 

acaaaaaata 

ctacttggga 

cctacgactg 

agaaataaat 

aaggactttg 

gataagcaaa 

aaatcatacc 

tctggaataa 

ggcagccttc 

aggcaagacc 

ttccccagca 

accggaatgg 

acgatctcgt 

aaaaccaaca 

agtaggcatt 

gtacataatg 

aagttcttta 

acacaggctg 

taacctcgct 

cacagagttg 

acctgtcacc 

cccctttctc 

ggaagacctt 

ctgtcctgcc 

gtagcatgag 

caggagtttg 

agaaagaaaa 

ggaatgtagt 

taaagcatct 

tatgttgtga 

cacccatcat 

aacatgaata 

cttttataaa 

ttatcatttt 

tttcactgtc 

tattgcatta 

aatattttca 

tcccagagat 

ataaacttct 

ctctttgatc 

tcttgggtga 

gttttataaa 

ttggccactg 

aaattaattg 

gatcctatag 

gaggctccat 

aataccccac 

tttaagggga 

ttcacatggg 

ggagggttgg 

tagagggttc 

gcttcccaat 

agatgttacc 


ccccgacctg 

tatgggccca 

attaataagc 

aattagatag 

aacaatgtat 

agaaataata 

cttggaagct 

tgaggtgata 

atatcacacg 

cacggtggct 

aggccaggag 

aaaataaaaa 

ggctgatgca 

caccactgca 

taaataaaat 

gggctcaaca 

tgccttaacc 

cacttcaagg 

tgcctggcac 

tgaatctgtg 

tggggaaggg 

gagtgaggct 

gaggctggag 

atgactcctt 

tatttaatga 

cacatgttta 

tgtgagccac 

aacatcaaca 

gggttgaaac 

gaatctcaga 

ttgtgaagat 

ccactgatct 

cctcaccact 

tcctgataga 

agaatctccg 

cctgtaatcc 

agaccagcct 

gaataataat 

aagtgtacct 

tgactttatt 

aaataatttt 

tctcccaacc 

cttactttac 

caaatatgat 

gtggctgcat 

tggggaaatg 

tattcttaag 

aggctcttgt 

gtaggaacac 

gctaatttaa 

attaacaggt 

agtagtcatc 

tgtgagcgag 

aggctcctiga 

gagaaaatgg 

tgggggcctg 

gtgtccccca 

atacatgtac 

aaggaattgt 

atcacctaat 

agagttgtgc 

tcaaattaag 

gcttccccaa 

catagcacct 


taatgaatga 

ctttggagtg 

ctgggggtgg 

aagtaataag 

tgtatatttc 

aatgcttgtc 

aagcagggtt 

gatatcctaa 

tatcccataa 

cacatctgta 

ttcaagatca 

cgaattagcc 

ggaggattgc 

ctctccagcc 

aaataaaaat 

gtactagcct 

cctgtgtgcc 

gtcattataa 

acagtaggag 

tcctctttgt 

cagagactga 

gggaaaggtc 

cataaggcag 

tatactgtta 

atgattccaa 

aaaattgagt 

agctatcccc 

tacaatgcca 

ccagcttttt 

tgtctagtct 

tcaataaaat 

ccagagttga 

ccgcatgcat 

ggaggaccat 

aaggagctct 

cagctactca 

gcgcaacata 

agtaataaat 

aataaatgtg 

ctataagcaa 

ccaacattaa 

ttcaataatt 

atggtcgcaa 

gttataaacc 

aatattgcat 

gaggataatg 

atcaaatccc 

tatattttac 

agacgtcatc 

taagtatgaa 

tgaactattt 

cacttccttc 

cacttcagag 

gcaggggaat 

gctgagaggc 

aactggggca 

tattagagct 

tcactctccc 

tctgtcctgt 

gaagggatgc 

aaaatgcaac 
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atcattgtaa 

taaaagagga 
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ttcaattttg 

tcagtgttca 

ggtctccatg 

tgactatgtt 
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agcagtgaga 

tagattgttt 

caaccttgcc 

atgctatcct 
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agcctggctt 

ccagctgccc 

acagcctggg 

cctagcctcc 

ctaggttcaa 
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cacttgagga 
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tgaggcagga 

attgcactcc 

gcagcttccc 

atgcacatga 

ccccttggcc 

agcccacaaa 

ctaagtgtgg 

cttcttgccc 

acagggctca 

ctgtttctcc 

atcagccagc 

tataaataat 

aggaggatgg 
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gctgaggcag 

ccattgcatt 

catacataca 

accctaaaaa 

aatttaggtt 

ccaggcagaa 

gtccctccac 

gcctggtgac 

aatctgattt 

tgggatctag 

ctggatgata 

ccttcaatga 

tcggaggcag 

cacagtgggc 

tagtacacgt 

cactccttat 

tgggaggccta 

tagtgagacc 

agctacttgg 

agccaggatc 

gtctcaaaat 

caatgcctat 

accaaataca 

aaacaactct 

ttatctatgt 

agtcaccagg 

tttattgatt 

tcgacagggc 

ttgctgggat 

actggggttt 

gagccagggc 

acactaaccg 

ggaggttctg 

cattgagagt 

gtgccaataa 

atggggtggg 

tctccagaga 


acagtagact 

gtgccaggtg 

cagagtcaag 

actcaacttt 

ggtgtggagc 

tgcagctttt 

ctaacacctt 

aggcaccacc 

caactgcaag 

atttagaatg 

gtggctcacg 

cagaagttcg 

attagccagg 

gaattgcttg 

agcctgggca 

tccacttccc 

atatccaacc 

accctttggc 

gggccgggaa 

tctagaagag 

ccatggaggg 

gggaaggact 

atctgtaaga 

acacgtaagt 

aggccaggcg 

atcacaaggt 

aaattggccg 

gagaattgct 

ccagcctggg 

tacatacata 

tgagacaggg 

ttcttgcctg 

caaggtgtgg 

ctcccagcac 

cagtgggaga 

tcccccaggg 

ctgagaggct 

cgaagatgta 

tgggacactg 

agaccaggct 

agccctagca 

gggtaggaag 

gagaccacag 

aggcaggcag 

cccacctcta 

gaggctgagg 

atgccactgc 

aaataaataa 

ttctccaggt 

gggtcagcta 

atgagctggg 

atttatctat 

gatggcctca 

gattgattga 

tggtcttgaa 

taccaacacg 
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ccaagtgttc 
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tcttggcttc 
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gcaggggcag 
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agaccagcct 

catggtggta 

aacccaggag 

acaagagcga 

aaccacagct 

aacatgtcta 
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agatgcagct 

aaactggcta 

ctttttgacc 

aggggaczatt 

tcatgacatg 

tggtggctca 

caggagtttg 

ggtgtggtgg 

tgaacccggg 

tgacagagca 

catacataca 

aaagagaaaa 

ttttagaaag 

aactcactgt 

agtaggcagc 

gctgaggagc 

cccacatcac 

ccatttttgg 

gctttgcagg 

aagcccacag 

agaactcagg 

agtgccaggt 
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gtgctgggtg 

attgcttgat 
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tgggaggact 

acaccagcct 

atgacagcag 

agtcactagg 

ctctgttatg 

aacaagtatc 

ctattcattt 

aacttgagct 

ttgattgact 
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agccaccatg 

gcgacttgcc 

caaagcctgt 
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ccgtggggct 
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ggctggagtg 

aagccgggca 

gcacttggaa 

ttgaagttct 

cagcactttg 
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cacttgtaac 

agatcagcct 
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ggaagggttc 
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tgcagtgagc 
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acaaccttgc 
ggagccctgt 
tgggtgacaa 
attgactggc 
gtctgaagga 
ctgggaaagt 
tttgggagtc 
acatggagtc 
tgtcccagtt 
tggaccagga 
tgtgaaggtc 
ccagacctgg 
cacctgcaat 
cgagaccagc 
tggtgtctgc 
gggaggcgga 
gtgagacaga 
ttctttctgc 
aaataatatt 
ctttgtttct 
ctcccatctt 
tttgagacaa 
gtcaccaccc 
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tccctagcta 
attgagccac 
gaggtgaagg 
aagggggatc 
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ctagagccac 
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tcaggagcag 

atccctagaa 
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acagagacac 

caagatattc 

actcagccat 

ttccttttac 

atccgagggg 

ctctgccact 
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ctgggaaaca 

ctgtagtccc 

ggttgcggtg 

atgacacact 
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atcagcattt 

tttaagcctc 

atttatttat 

ggtccttcta 

cccaatttct 

tctcgctaag 
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gccagcactg 

ccagtgtcca 

agcccaactc 

agaattactg 

acctcatttt 

cgatgaccgg 

attgtctagg 

ctccccccaa 

gtaagagggg 

atgagaccag 

gctggaaaca 

tgggggtgga 

gcggggatga 

ctttcctcgc 

cttgggggcc 

gacagcacac 

gtgatatttc 

acccagcgac 

acccccaaca 

tcggggtggg 

ctgggtgagg 

acgagagggg 

ccgaaaatgg 

ggaatgaaac 

acgagagaca 

gacagtgaga 

gagagagagt 

cgggggagaa 

agcagagaat 

cactgcattg 

aggggcttct 

cacctgcggc 

gctcccccga 

gagtcaagac 

ctgctgatac 

tccgcccctc 

gctttgggtc 

acatacacaa 

cactaagggg 

gcgcaaaacc 

ttcagcggct 

caaagggtaa 

cacgctcata 

gcacactggg 

ggacagaaat 

gggtgacc ca 

cggcgttcct 

gcgaggacga 

gagggggtta 

cagcaaacga 

cctcgctgtc 

gactctcgag 

cagccttccc 

ggcccgcagc 

tcccgggccc 

gagtgggagc 

gggctcaggc 

gggagagaag 

ggtaacctgg 

tctcagcagg 

ggtgggacgc 

tgtccccgct 

actcagcaac 

atgtgggtag 


cggccaaggc 

agagtccagc 

atgttcaggc 

ccaaaatagt 

accaagcact 

gtagcctcac 

gtcacctgga 

ccctaaccgg 

tcagagtgct 

ccagccaccc 

cgacaatgga 

gggggtgagt 

gcagaggggc 

agctcgtccc 

cggggtcc c c 

ctgagtcagc 

gctccccggg 

agattgtgcc 

ctccctcccc 

tcctggggaa 

gaagacggga 

cgggaaagga 

aaccagcagg 

tggggaagag 

gagacgcagg 

aagacagaag 

gagggaggga 

aaccagagaa 

gaatgggaag 

gaccctggca 

gtgctcgcgg 

tccagcagcc 

gggccccgag 

ctgggtcaac 

cgcctgtgac 

tttaaaaggg 

ctctcctgca 

aaaaacgcac 

catacacaca 

tcgcacagcc 

cacccgtgtg 

cgggcaagca 

tgcactcgat 

cttgaggtct 

gctggaggct 

gaagctcatc 

ctaaactacc 

tcgaacacag 

gaaagccgct 

gggaggagcc 

cccactctcc 

cggacagcgc 

cgccccgtcg 

ccgggcggcg 

tcccggagcc 

gccctccccc 

gcctgctgca 

cagggcgtcc 

ggaaggggaa 

agggcacttg 

agggggcagg 

ccacctgctc 

tcctcctgcc 

ggggctgtct 


ttgaagaccc 

cagaggccga 

atgatgtgtc 

aacgacatta 

ttctcatgcc 

cgtacagatg 

gaactctgat 

agctaggtgg 

ctacagagaa 

acccaccagc 

tgacacgagt 

cagcagataa 

tgcgactgcc 

aggaggagga 

tcccacccct 

ccgccgccca 

agccagcccc 

gcggctcatt 

gccgccgcct 

acgcagggtc 

gccccagaga 

aatttagggg 

gggcacccga 

ggggagagaa 

gagccccgcg 

accgggcagg 

gggaacagag 

atcgaatgag 

aataagacca 

cggataggaa 

gagcggcagc 

ccaaccccgc 

cctatcctac 

cgccctgcag 

caggccatga 

ggtagaggaa 

cccccgcggg 

acactcgcac 

ccgggcacat 

tcacgtttcc 

cacactcacg 

tcctgagtca 

cttgcacgca 

gggagtggaa 

gggacactgg 

ttctcctgga 

gcattccccc 

tcctccgggt 

cccgcctcct 

agcgagtgcg 

ctcggctagc 

agctagcggg 

cgccccgccc 

cagggtagag 

ccgcggggtc 

cgctgccccc 

aggtaagaac 

cctctttcag 

aagctcagcg 

gctgggagcc 

tggacccggc 

cgggggacgc 

cggtgcctca 

ggggaggtag 


agcacaccaa 

gtcctcgatc 

tgattctact 

gctacctacc 

tggggtgcct 

aggaaactga 

ctccagactc 

ggtggggaca 

gaccaaatgc 

cagccaccta 

tccctgccct 

ttatgggaaa 

tggagccagg 

gctccccccc 

ccccgaagag 

cccgcccctc 

actgcgctcc 

ccggggaagg 

ccaggccctc 

ctgtcctgcc 

cttttctttc 

cacagagagg 

gagccgaggt 

agggaggcag 

gggaggagga 

gaaacagacg 

agacagagac- 

aacgcgagaa 

acatttatca* 

aggaggagcc 

ccagggggct 

cagcgctgcc 

gccggggcgg 

cctttgtagg 

agggccagag 

gttcacgcga 

agataaggtc 

gcgcgcccat 

atttctttcc 

caccagctca 

tgcccccccc 

cacctgcaca 

caaactcttg 

ggaaaagtgg 

cgcgagggac 

aagttgggag 

aagaagggat 

cgcttaagcg 

agtggtcgag 

ggaaggagtg 

agcctgggca 

gcgcgggcgc 

cgtcccgtcg 

cgccgcggcc 

cccgccgtgc 

tcccccgagc 

gccagcggcg 

ggattgaggg 

ctggggccgc 

cgcgggcgcg 

ccggagcggg 

tgaggactcg 

gcactttctg 

gaggcgcaga 


agcccggcca 

tcaaaatgtc 

gggacaatca 

acccctccac 

ggagacttaa 

ggcacaagga 

aaatttccaa 

gcaaatgtgg 

attgtggcac 

aaagtcttca 

caaaaagctg 

ccgtgacacc 

gattcccgga 

agcttcgggg 

cgcgggc ccc 

agcgtctgtc 

ggaggcagct 

acgccaaacc 

cccccaggcg 

tcctggaaat 

tgtttctacc 

agctgggggg 

gcccacgggc 

agacaccgag 

gagagacgaa 

agtagagaca 

cacgaaatat 

gaacgagaga 

agagccgact 

gcggcgcggg 

cagcagcccc 

tggccaccgt 

ctccgcggac 

gaagtgccta 

gggctccagt 

agccaacagt 

tcccctcccg 

ctcgcacccg 

acccatcccc 

gacatgcacg 

cccgcttccc 

agcatccttg 
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aatcttggag 

gcggctgggg 

gggggaacag 

ttctctagaa 

ggggggaggg 

aaagggttaa 

ggggtggttg 

cacggacaga 

tgggcgtcga 

gggccgatgg 

cggccacgca 

atccggcggg 

atcgagacaa 

ggagagcgga 

tggggcagtt 
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tgcgaggagc 

gagggaggct 

ggccggctgg 
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gggaaatcca 


agcctccagc 
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tctggtagga 

aagggtgtgg 

aggagaactc 
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ggtgaagacc 

agactatgga 

gggaccctgg 

gggatggtga 

atggatggga 

agggactggg 

tccacgtggg 

gtggctggtg 

gacagagggt 

gctcctgtgc 

accacggtgg 

cagctctaaa 
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tgcccgcagg 

gtctgggacc 

atcctttcct 
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cggcccaaag 

ttgatagcca 

cttcagcctt 

ttctggccat 

tctagaacaa 

ttccttgcct 

gcacctgggt 

cgtggtttct 

tgggcttgat 

ggccactgtg 

ttgacctcca 

tgcagtgcct 

ctgcagctct 

aggctgctac 

ctgagaaaga 

aactcacact 

taggtggcca 

ctccaccggg 
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gggaagggac 

ggggagagag 

ccctggagtg 
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gttacagctg 
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tccctgccaa 

tcatcacctc 

gaaatctcca 

tgtgcttcca 

accccacgtg 

aggggtgaga 

tgtcactgag 

acgcggagga 

gacactcacg 

ccgcacctgt 


gagatggagg 

gccagagact 

aaagatggag 

gagaaggaat 

aaggaagagg 

cagggaccca 

ggactggaat 

ctagagaagg 

gaggggctgg 

gattgggggt 

aagccctcca 

ggccagatgg 

gaggactttg 

tgccatttat 

ctgcttattt 

cctccgtcac 

ccatcccact 

ccccagccct 

ctgtgcccct 

gggaccactc 

ctttgggtcc 

agtggcctgt 

gagtcctgga 

ccgggatcaa 

atattctgga 

ccgcaactgg 

tgaccctgaa 

gcaatttctc 

gcggaggatg 

gtccctcttt 

tcatgtttct 

ggggattatg 

aggacaaaat 

agccatcccc 

aaagggcttt 

caatgaaggc 

tcccctcccc 

ccgaggtatc 

aattcacaaa 

gcctggaggg 

gtggcaagaa 

ggaaaggcac 

ctcgctccag 

tccttgcctc 

ctttgcacgg 

ttatggagga 

gggctggggg 

ggaaagagaa 

aaagaggagt 

cctgatgcca 

tggggcagcg 

ccatcagggc 

aggatttggc 

acaggaaggg 

cactttgctg 

agtagaactg 

gttctaagcc 

cactccacct 

ggtctaagaa 

agggccgaga 

tattacatca 

cagccagcac 

gtggggccca 

gtgctccccc 


gcgctagaaa 

gggggtgggg 

aaagtggtga 

aaaggttaga 

gaaggagagg 

gaatcctggg 

ggaaaaggag 

ggtatcagga 

agaggaggga 

gctgcccagg 

ttcccaaagc 

ggggtgcttt 

gaaggggagt 

ttatccggcc 

tggggtgtta 

ccgcgtcagt 

cctcagctcc 

tccctgaagc 

ccttccagag 

tgcaccagcg 

agttttcttc 

gggctgagaa 

cagtggtccc 

taccatgttc 

caggggtcat 

aagcctagga 

tagccaaaca 

tcctgtcttt 

tggccacctc 

tccaatctgg 

ggggctccct 

tgtgttgtgt 

aggtgcttct 

tgctgacttt 

cctctgtcca 

gagctgtaga 

agccctcaat 

cccctttcca 

atgaaggcca 

aaggagtggt 

agaaaagggc 

agctctctag 

ccccagcagg 

cagtgagagc 

cagtggcaag 

gcccccagca 

aggtgagccc 

gagaagtagt 

aggcaccctt 

ggggaatggg 

gtcagtccgg 

tcacactctg 

ttcttttggt 

aagtgccctg 

tgtgaccttg 

atgcttgaac 

ccaaacctca 

ctgcctggtt 

actcgggagg 

aacgcggagg 

cacccagcct 

gcaccgacgc 

aaagcgagga 

ctctggagga 


gaggatagtt 

tggggagggg 

gggaagctca 

tgggaaaagc 

aaagaccaga 

atggaggtag 

aatggaaaga 

accggcgaag 

agaggaggag 

gatgagacaa 

gcctgcctgc 

gagctcaggg 

gacagcctcc 

cggacgttgg 

cattctggca 

gcctccccag 

cacctcctca 

agcccgttgg 

cgagaggcct 

actctgccct 

tcctcaagtt 

ctttgtttct 

tcagtgaaca 

tggcctctct 

ctcttcttga 

ggtgaaagtt 

gactcagcat 

acctcaaggg 

ttaaagaaag 

gttccacctt 

tccttcatct 

tcaatgtata 

tggtttcctc 

agaggagggg 

tgaagtagtg 

gacctggtca 

ggtgttacct 

gctctgatac 

ggagtggagc 

ggtgctgagg 

acacactggg 

tcccccaacc 

cctggggcag 

cgctgcgtga 

ggccagaaat 

ccgggtGgcrt 

tggttggggg 

ctaatgtggg 

gccagctcct 

ccttgcctct 

aggctggggg 

gtgagcttgt 

gtcaagtcct 

gctgggcatc 

ggtaagtcgc 

cagaggaatc 

accctcatga 

cagatgaggg 

agaaagtaat 

tggtgtgaac 

acacacgcac 

agcaccgacg 

gcagcacact 

ggaacaccag 


ctactgattg 

tcagggggag 

aaggaggagg 

gtggagggaa 

tcaggggagg 

cgggaaagag 

tcagaaacca 

agggttggag 

gaagagaaag 

agaggcttct 

cacatttctt 

ccctgggggt 

gagggtgggc 

attctgcagc 

gagtgagaag 

gcccctgcgt 

gcccctgccc 

gtgtggagcc 

ctgctgcctt 

gtgggtgggt 

ccttcttcta 

gagccttggt 

gatacttttg 

tggctccctc 

ctcccacatg 

gcagagagag 

tgtggetggc: 

cagtgtctca 

atcagagtgt 

gtactagctg 

gcaaattggg 

aagaagttaa 

ctaccctgct 

gtgagcagag 

gagggatgaa 

ggaggcctgg 

atgatgtgag 

tctgtgcatc 

cgtggtcctc 

aggagtgaga 

cagggcaggg 

ccccagtccc 

tgaagcccag 

attatggatg 

ggcaacagag 

cttcagagag 

agacatggga 

caggtgggga 

gcagagttta 

gagattgcac 

aaagctctgt 

ggataagggg 

tagggaagtg 

aagagacttt 

ttgctctctc 

gaggggacct 

aacccactca 

gtaagagaca 

cgtgaaacgc 

gaatggaaca 

ggggcccggc 

cagcgccagg 

gggagtgtgg 

ggcagctggg 


agtgacagat 

agggatagga 

gagatggagc 

gtgggaccca 

gatgggaaga 

aatcaggact 

gagaaggatg 

acagggaacc 

gctgagagag 

ggtaaccact 

ctctcaggga 

ggctgtgagg 

aaacaggctg 

cgctgccgcc 

ctgtttgcag 

cactggcatc 

cctcagcatc 

cttgcttctc 

tccagggagc 

agcctggatc 

caggggcctc 

actccaaggt 

gctctggaca 

ccctggtcag 

taatcactac 

ctggagtccc 

ccagccctag 

cacattcagg 

ctctctgaca 

catgacctga 

ggccacaata 

cctgtacaaa 

gtactctccc 

agggtggggg 

atgaaggctt 

ggtgctcaga 

gggtcggctc 

ttgccccagt 

gggagagaca 

acagggggtg 

actgagggcg 

accacctctg 

agccccctcc 

agctccttgg 

tcactgttat 

cctgcaggga 

acaagatgga 

gcaggagagt 

ccctcaaggc 

atccttccct 

aatcctccag 

taggattaag 

gagatcagag 

tctggccctt 

tgagctccag 

ttgcggcttt 

gggtccccac 

ttgctcctcc 

cgcacggggg 

gcagccgctg 

gctcacacac 

aggggccggg 

atcttccacc 

atgccagcgc 
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124080 
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124920 

124980 
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125220 

125280 

125340 

125400 

125460 

125520 

125580 

125640 

125700 

125760 

125820 

125880 

125940 
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126060 

126120 

126180 

126240 

126300 

126360 

126420 

126480 

126540 

126600 

126660 

126720 

126780 
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127380 

127440 

127500 

127560 
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cacactcggg 
gagcacctgg 
gcaacctcag 
gccttggaga 
cccaaggagc 
agcctaggac 
cctggctttg 
aggagtcacc 
tatcagctat 
tatgagtaca 
catcagtatc 
aaaaaaaaaa 
gccaaggtgg 
gaccccatgc 
tgtcacccag 
gttcacacca 
atgcccggct 
aggatggtct 
attacaggcg 
ggcacgggcc 
agtgatccaa 
caagaccctg 
agcctggggt 
aggactggct 
ggctctgact 
tggcaagacc 
ccaagttact 
atgtctggag 
tagaagacag 
ccggcccaac 
tctagacaag 
ccaatttgca 
tcagaaagtg 
catgggagca 
ggtcttgttg 
aaactcctgg 
acatgccacc 
tgttgcccag 
aagtgctggg 
ttggggggczt 
tcccctcccc 
ggcgg ccccc 
ccaggccaac 
cctgcagaac 
cctgtggctc 
agccctggag 
cttccagggc 
gcccggcaac 
cctgctccac 
ctctctgtgg 
acccttcctg 
ctccaggggc 
tctctctctc 
gtctatccct 
tgcctcttct 
cgtgtgtgta 
cccccagggt 
gcccacatca 
aagttgcgct 
gattcccatt 
tgttatgcag 
attgttatta 
aaatcatccc 
aaaatcttcc 


gcctgtcagt 

ctcacgccac 

gagcgtgaca 

ccccacactc 

cccaggatgt 

gttgcttgaa 

ggggtgcagg 

tggaaatcat 

agtgacggca 

gacacctgca 

ataataagga 

gctttggctg 

gaggattgtt 

ctacaatttt 

gctggagtgc 

ttatcgtgtc 

aaattttttt 

cgatctcctg 

tgagccaccg 

tgtggtccca 

gctgcaataa 

tctccaaaag 

gtaaaatgca 

ggtggcctca 

ctgcaagggc 

agaccccaga 

tccagtcatt 

acattttggt 

agatgctgct 

tgtcaatact 

gtcaaacatg 

gtgctttcta 

gaagtggcca 

ccagggctct 

tgttgcccag 

gttcaagcga 

atgcctggct 

gatagtctca 

attacaggtg 

ggcccccagc 

acagctcccg 

agctgcccca 

aacttctcct 

aacctcatcc 

ttctccaaca 

gagctggacc 

ctggagcggc 

atcttccgag 

ctacaggtga 

gcccctctgc 

cctcagcatc 

tttacttttt 

tctccctcta 

ttccatccat 

gtctgtctcc 

tctgtctctt 

cccctgccca 

cggtgaagta 

tctctccaat 

ttccaaacct 

ctggagaatg 

ttattgatct 

actccatttc 

cttggctaca 


cccatgcgtg 

tacccaaaat 

caacacacac 

aaaatcacca 

cagagtgcag 

gcagaaggtg 

ctcccttttt 

aagaaagtgt 

aaggccaggg 

ctccactctc 

tggatcatat 

gatgcggtag 

tgaggccagg 

tttttttttt 

agtggtgcga 

tcagcctccc 

tttgtatttt 

acctcgtgat 

cgcccggcca 

gctcctcagg 

gctgtgatcg 

aaaaaaagaa 

gattcccagg 

cttagagacc 

gaaaagtaca 

gtgagtccat 

ctccaatggg 

tgtcacaact 

aaatgcctta 

gaggcagaga 

caatagtgaa 

caatggcctt 

ggccacttga 

aggcctttat 

gctggagtgc 

tcctcctgcc 

aatttttttt 

aactcctggc 

tgagccacca 

tggcactcct 

cctcggcctg 

tgctctgcac 

ctgtgccgct 

gcacgctgcg 

acctctccac 

tcggtgacaa 

tgcagtcgct 

gcctggtcag 

gcctgccctg 

tccccgaccc 

tccatttctc 

cccttctgcc 

actccacaac 

cactgcctct 

cttcacacac 

tctgtgatct 

aaggcctttg 

gagagagaag 

tcactgggca 

gtcatctcaa 

gatgctctga 

aattattgtt 

cccaggaagc 

gagcctccgg 


cacacctggc 

ca'cagataca 

aaaaccacca 

acccctcagt 

aaacaagtct 

ttcagtcact 

ccccaggcaa 

agaggtcaag 

atgatgggag 

tagcccccag 

ccaaccttca 

cttatgcctg 

agtttgagac 

tttttttttt 

tctcggctca 

aagtagctgg 

tagtagagac 

ccacccgcct 

aaaattttta 

aagctgaggc 

taccactgca 

agaagttttt 

ctgtcccccc 

cgacccttaa 

ggaaagtaag 

ttcacacggg 

gtgactttgc- 

ggaggcaggg 

tatagggctg 

aaccctgacg 

aacaggaatg 

ttggcattat 

ggctataacg 

ttttattttc 

aagggtgtga 

tcagcctccc 

tttttcttgt 

ctcaagcaat 

tattcagccg 

gccctggaag 

cctcctgctg 

ctgctactca 

gtccctgcca 

gccaggcacc 

catctacccg 

ccggcacctg 

gcatttgtac 

cctgcagtac 

cccccaccct 

tggcgtgcgt 

tctgtctatg 

tctctacctg 

cttcacctct 

ctcactaact 

ccactccgca 

cacgtgtttg 

cagctgtttt 

gcagagccac 

atgggacggg 

tgcaggggaa 

aaatggaagg 

tattgttgtt 

aataacacac 

ctggaagggg 


tgagcagcac 

tacacacatt 

ctaagcaagt 

ctctcccagg 

tcctcccctc 

gtgtgcccag 

aactgccaga 

ctagttccgg 

gccctgcacc 

gctctctggg 

aaagttactt 

aaatcccaac 

cagactgagc 

tttgatacag 

ctgcaagctc 

gactacaggc 

ggggtttcac 

cagactccca 

aaaaattagc 

aggaggattg 

ctccagcctg 

aagtaactgc 

aggaattctg 

ggcccctccc 

ggcactgggc 

cctcagatct 

cccccagggg 

tgctgctggc 

cccccacaac 

ttagtctttt 

aagagatgat 

tttttaatat 

ttgtcccctg 

tattttttcc 

tcgttgctca 

aagtagttgg 

aaagacaggg 

cctcctgcct 

ggtctaggcc 

cccacctagt 

atgctcctgg 

tccccgccca 

cccagcactc 

tttgggtcca 

ggcactttcc 

cgctcgctgg 

cgctgccagc 

ctctacctcc 

cagccccttt 

ccctcctctc 

tctcttttct 

tttaggtccc 

ctgcctctgc 

tgcctccccc 

tacaccccca 

ccttcagggc 

tctcacccac 

agccactggc 

agaagcccac 

gaaagaaaag 

aataccagta 

atgctgactg 

cctccaaacc 

gtgaaaatat 


tgcatttggt 

cacgcacacg 

gcaatttgca 

gtctctgaac 

tgccttcaaa 

ggaatgactg 

agaaaatccc 

cctagaactt 

cctattaaaa 

cctgcttttc 

tgggggaaaa 

actttgggag 

aacatagcaa 

agtctcgctg 

cgcctcccgg 

gcccgccacc 

cgtgttagcc 

aagtgctggg 

tgggtgcagt 

cttgagccca 

ggcgatggag 

gaatgaggag 

catagttcct 

ggcacaaaga 

accagtgggc 

ccaaagggtc 

acatttggca 

atctagtggg 

gaggaactat 

gacattaatc 

cattcttcaa 

atgagaagcc 

agcccccaga 

cctgaaacag 

ctacagcctc 

gactacaggc 

atctccctta 

tggcctccca 

ttttaccaag 

aagttctgct 

ccctgcccct 

ccgtgagctg 

agcgactctt 

acctgctcac 

gccacttgca 

agcccgacac 

tcagcagcct 

aggagaacag 

ctggtttcct 

tccccaggcc 

ctcttacatt 

ttgctgttcc 

ctgtctgtct 

atctgtcttc 

tgtctgtctg 

actctgcctt 

cctcaagtct 

atcccacaga 

accccttcta 

ggtaaatctc 

attgttattc 

tttgacacgc 

accctgagag 

ccaaattctg 


127680 

127740 

127800 

127860 

127920 

127980 

128040 

128100 

128160 

128220 

128280 

128340 
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128640 

128700 
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129300 
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129600 
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129780 

129840 

129900 

129960 
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130140 

130200 
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130500 

130560 

130620 

130680 

130740 

130800 
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131040 

131100 

131160 

131220 

131280 

131340 

131400 

131460 
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ccctctccct 

ctagttatca 

caatagccat 

tacagtagtg 

gagggaaaga 

ttgcctaaat 

gccacagtgc 

ttttttttac 

catctcctag 

gtgcttggca 

aacgagcctt 

atgtgggtca 

acagtttatg 

ttttcacctc 

gatatagtca 

gtcttctcac 

tagggacatg 

tcatggtctg 

aagcacacag 

tttggacaaa 

ggatagtact 

tggagtgcag 

ttcctgcctc 

tttttgtatt 

gacctcaggt 

ccatgcccag 

atttagcaca 

actactacta 

atgctttcaa 

gggcctgagc 

ctaaccctct 

ttccaccaag 

gctggtgtgg 

acttgaagtc 

aaatacaaaa 

taaggcagga 

actatactcc 

aaaagcccat 

ctcaggccaa 

caggtggaca 

atggtaccag 

gaaaacacag 

aggtgaggaa 

ccaggcctgc 

gaatggggaa 

gtatctacca 

gtggccctgt 

tgttttcttt 

agtggcacga 

tcagcctccc 

tttttagtag 

ggtgatctgc 

gccgacactt 

ttttgttttg 

cgcaatcttg 

tccccagtag 

agagacgggg 

acccgcctcg 

ctgaatggtt 

ccacacctct 

ggacccatct 

aggagatact 

tagctatcca 

ccaaggtcac 


acttgaacct 

atctgctagt 

aacaacctaa 

aacagaacag 

agtaaaatgc 

agtgacagta 

caagcacttt 

ctcctcagac 

ggattttgaa 

tacggtaagt 

atttaacatt 

acttcaagga 

aagtgcactc 

cttgttaaca 

cctccaatca 

catcatgttg 

gaatgaatgt 

gctacaaggg 

tttggactca 

ttacttaacc 

taataataat 

tagtgcgatc 

agcctcctga 

tttagtagag 

gatccactca 

ccaataataa 

aggattaaaa 

ctactactaa 

aatctcctta 

ttgaatcccc 

ctgaacctca 

aatggctgta 

tggcccatgt 

aggagttcga 

attaggcagg 

gaatcacttg 

agcctgggtg 

agttcagtgc 

caacagcaat 

ttggggtctt 

gcatcatggg 

cctctgcctg 

acatgggctg 

cccaaagggg 

gggccaggca 

catgccaggc 

tctcccagag 

cttttttttt 

tctcggctca 

aagtagctgg 

agacggggtt 

ccaccttggc 

gttttctctt 

tttttgtttt 

gctcactgca 

ctggaactac 

tttcaccatg 

gcctcccaaa 

taaaaatagt 

aatttgctct 

gtgaaatggg 

actgagctca 

ttaacaccag 

ccagctagta 


ggaacgtgct 

tggaaaatca 

caaacatact 

accaaacccc 

acggtatatt 

atagcagtag 

ataggtatcc 

ctcagttttc 

agcattaaat 

gctatataaa 

ggtttcagtg 

ttatactaag 

agccacctca 

atggagaagc 

taaatctttt 

agcctcacaa 

tcctgaggcc 

gacagtactc 

gcaagaccta 

tctctcagtc 

cttttttttt 

tcggctcact 

gtaactaaga 

aagggtttca 

cctcggcctc 

tccttattta 

tgtaaggcat 

tactgagatc 

tcaatatata 

gatctactat 

cttactttat 

aataggaaac 

ctgcaatccc 

gaccagcctg 

cggggtggca 

aacctgggag 

acagagcaag 

tgaagaaatc 

caggacctga 

ctgaaatggg 

aaggaagcaa 

tgaactgcct 

gggtgagatc 

agctttggag 

ctgtgccagg 

aatgttccag 

catattccct 

tttgagacga 

ctgcaacctc 

gattacaggc 

tcaccatgtc 

ctcccaaagt 

tcagtcatta 

tgagacagtc 

gcctcacctc 

agacatgtgc 

ttgcccaggc 

gtgctgggat 

ctttatgctc 

atggctttgt 

gataacctgt 

cagcccaatg 

ggaggacacc 

aggaacatga 


tcctctgcct 

ggtcagtgct 

gagcacccac 

ctgccttcac 

ggaaaaatat 

ccgccaccac 

actctgccat 

tcatctgtac 

gcatgaataa 

tgcttgttaa 

aagtggccca 

gtcatgagtg 

tctcatttct 

tgaggctggg 

caaccattgt 

caacctggtg 

acacacccag 

tggagtacaa 

ggttcaaatc 

tccatttcct 

tgagacgacg 

gcaacctctg 

ttacaggcat 

ccatgttggc 

ccaaagtgct 

agaagttttg 

ttagcacata 

aaatactact 

ttagttattt 

tttctgactt 

ctgcaaactg 

gagttagtgt 

agcacttcgg 

gccaatatgg 

ggtgtctgta 

gtggaggttg 

actctgtctc 

atgttattat 

ggtcagcaaa 

aagtgtttgt 

cttcacacct 

ggtagggctg 

cgcagggtgc 

gaaactccac 

tgagttcatt 

gtgccaggga 

actcaagtgt 

agtctcgctc 

tgcctcccag 

atgtgctacc 

ggccaggctg 

gttggattac 

cagtggcctg 

tcactatgtc 

ctggggtcaa 

caccatgtcc 

tggtctcaaa 

tacaggcatg 

aagcagatca 

gcaagttatt 

accttggcga 

tctggtacaa 

aactgaagct 

cctagaattg 


catccagggc 

gatgatgcta 

tacgagctag 

agagatacca 

gtcttatatt 

ttagtgggta 

ttacaagcgt 

aatggggtag 

tttgtaaagc 

aatactattt 

acttggactc 

agtcccagaa 

acagcccagt 

ggccctgaag 

cggtgtgacc 

atagggacag 

gaagagctgg 

ttgagcaggc 

ctggctccta 

catctctaaa 

tcccactcta 

cctcccaggc 

gtgtcactac 

caggctggtc 

gggattacag 

taaggattaa 

tgggcactat 

acaaattgat 

aggaggaatt 

atttaacttt 

ggaataatga 

atagaaagcc 

gaggccaagg 

tgaaaccctg 

atcccagcca 

cagtgagctg 

aaaaaaggaa 

gaccccatcc 

ggcttgggca 

tctctacgcc 

ggccttttat 

ggctgggaga 

aggtgtgacc 

cagaggacca 

catcaacaga 

ttcaggagag 

agccagatga 

tcttgctcag 

gttcaagcga 

atgcctggct 

gtcttgaact 

aggtgtgagc 

catggttttt 

acccagctgg 

acaattcccc 

agctaatttt 

ctcctgaact 

agccaccgta 

gatctcagtt 

taaccactct 

gcaggggttg 

agtgagtatc 

cagcaaaata 

gcccaggtct 


tagtgcctaa 

atgataataa 

atgctaagaa 

ttcccatgag 

attcttattg 

cacagggtca 

gtgacatttt 

caagagcacc 

acttagaata 

taaaaaaaga 

catcctgaag 

attgcacctc 

tgggagatta 

accctataga 

ggaggcttat 

ttaggggcac 

cgcttgaacc 

tcatttttga 

tatatatgac 

atggcaatca 

tcgcccaggc 

tcaagtgatt 

acccagctat 

ttgaactcct 

gtgtgagcca 

aatgtaaggc 

aataataatt 

catgcattta 

tggagtcaga 

aagcaggttg 

aaataatacc 

catagttcag 

tgggtggatc 

tctctactaa 

ctagggaggc 

agatcgtgct 

aaaaaaaaaa 

tccattgact 

gaggggacct 

cctggcatga 

agaggagatg 

tgccacaggc 

caagatggag 

cagcttttca 

tatttactga 

aacagaaaca 

taaagacact 

gctggagtgc 

ttctcctgcc 

aatttttgta 

cctgaccaca 

caccgcaccc 

gtttgttttg 

agtgcagtgg 

catcttagcc 

tctattttat 

taagcaatcc 

cacagctggc 

tgaattccag 

gagcctcgat 

tgaggattaa 

caatgaatgg 

aaagcacagt 

gtctgactcc 


131520 

131580 

131640 

131700 

131760 

131820 

131880 

131940 

132000 

132060 

132120 

132180 

132240 

132300 

132360 

132420 

132480 

132540 

132600 

132660 

132720 

132780 

132840 

132900 

132960 

133020 

133080 

133140 

133200 

133260 

133320 

133380 

133440 

133500 

133560 

133620 

133680 

133740 

133800 

133860 

133920 

133980 

134040 

134100 

134160 

134220 

134280 

134340 

134400 

134460 

134520 

134580 

134640 

134700 

134760 

134820 

134880 

134940 

135000 

135060 

135120 

135180 

135240 

135300 
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agagtgcagt 

ttggacccta 

agataataat 

gtagcacaga 

gccgatgatg 

cacagagtag 

aattcataaa 

tcacagatga 

aagcagctca 

aagaacagga 

cagagagctg 

ggatgacttt 

aggggcgggg 

ttttaacagt 

ggatttggca 

gaagggaggg 

agaaattctg 

ctctggaact 

tgctcactct 

tgtcctactc 

agggattcca 

tgagtcaggc 

gggtggtggc 

ggacggatct 

agggcggaag 

tgggtgtagg 

gtggcaggca 

gggaacaggc 

agacctcctg 

cagtgaggtc 

ccccagtgct 

aagtcagctg 

ccactggccg 

caaggtcctt 

gacagcgcct 

cctctggggg 

ttcccccagg 

tattgtcata 

cttaacagca 

gaatattgta 

acagctaggg 

cttcataatt 

tctttatgtg 

ccccatttta 

taatggctaa 

ctcatggaac 

ttcatttaat 

tgcccacggt 

ccagtctcca 

gccacactgg 

ctcctgaatt 

aatacacccg 

aggttagagg 

cttctgccac 

tgcctcaggc 

gcggacctgg 

gagcacgtgt 

cagggcgtgc 

aacaacagcc 

ctgcggctca 

ttccagcgcg 

ggccgagacc 

acgcggccgg 

gaggccgggg 


tgttcagagg 

ggcgagtcac 

agtatccacc 

gcttggcaca 

acgatgatac 

acactcggta 

aatcagatgc 

aaagacccag 

tttgattagt 

tgctgtcagg 

tgaaccggcc 

agcagaggct 

ggagggctag 

ggagagagga 

acagcttgga 

gcaggacttt 

tgggccaaaa 

ttggtctact 

cccccaaccc 

ttgctgacac 

agcctacccc 

tgaggccagg 

agcagggggc 

caatgtggcc 

ccagatggga 

ctgggagtca 

tggggaggaa 

cccctgggtg 

gaacctgctg 

agggagctga 

ctcctctaaa 

atcatcagaa 

tgggccacac 

tctgcactaa 

ctaaggggag 

tggagctgat 

gagctcctct 

gtagccacca 

tttatcatcc 

catactgagg 

aaaaggcaag 

ggaccaaatc 

aaccatcctc 

cagatgagga 

caaacaaaaa 

tcatagaact 

ctctacatta 

ggcatcagtt 

tacttcacca 

cttccctggg, 

agaaactctg 

aaagtttagg 

aggcagaatc 

ggtgcacccc 

ccattctctt 

ccaacctgag 

ttcgcggcct 

accgcgcggc 

tggcctcgct 

acgctaaccc 

cgcgcgtgtc 

tgcgcgcgct 

gcagccgcgc 

cgcccccagc 


tctctggagt 

ttcacttctc 

tcatagggtt 

tggtaagagc 

ccatcctaga 

cagctctgtg 

agatgggatc 

gcccagagag 

gtcagagcca 

gaagcaggca 

agtggggagg 

ctcagcccag 

ggctgtgaaa 

aggaccggat 

tggggggagg 

ttgcagagaa 

cctggggctg 

cttctgacct 

ccggcccctt 

ttcataaaaa 

ctggatgggc 

gaggtgcaag 

ggagaatggt 

aagaggaggg 

gggggcgaga 

atgccctccc 

agcgtctccc 

aactggcctg 

accacagtga 

agtcctgctt 

aaagtgagct 

tcccctggtg 

acacccgccg 

tgtggcc:tga 

tagtaatgca 

gcccctcacc 

tttaaagaaa 

tttattgatg 

aaccctcctt 

aacctgagac 

ctggattttg 

cctgtgtgct 

tggaatcctc 

aacaggcaca 

caaaaacaaa 

ccacaaggaa 

tacagatgag 

aatgacagat 

aaccagaagt 

gaacctgtag 

ccccccgccc 

aaccactgac 

caaatccagg 

cttccctccc 

cttctgtgcc 

ccacctcttc 

gggcagcctg 

cttccgcggc 

gcccggcgag 

ctgggcgtgc 

cagctccgac 

ccgcgaggcc 

ccgcggcaac 

cgatccctcc 


tggaagccac 

tgaggctcca 

gtgacaatta 

tcaatcagtt 

ctgatgagct 

gaatgaatga 

ttaaagatca 

gtgcttggag 

agagctggga 

gggatgctgt 

caagggaaat 

agggagggga 

gtcaagagct 

ttgaaagcta 

aggcaatgga 

acaaaaggag 

tgggtcaaag 

cccgaggtcc 

atcgatcctc 

gaggaacccc 

ctggaagaga 

gagccagctg 

ggtgtcagag 

ctcttggcac 

acaggcagga 

ccaacctgag 

caggcagtga- 

agcagagtgg 

tgccctgcac 

ccctctctgg 

gggctgatgg 

agctggttat 

ccccccgctg 

ttaggtgact 

atgtggcttc 

ccaataccca 

agggacagga 

gttgactatg 

tagcctgctg 

tccatgaggt 

aactagggct 

gggcacgtgt 

agaacaaacc 

gagagatgac 

aattaaaaaa 

aggtgttcta 

gaaactgagt 

ccaagatttg 

tctgaaactc 

acatggggat 

caccccgctc 

ctcaccaata 

atgctatgaa 

ccggccaagg 

ccactccacc 

ctccacggga 

gaccggctgc 

ctcagccgcc 

gcgctcgccg 

gactgccgcg 

gtgacctgcg 

gacttccagg 

agctcctcca 

accctctacc 


gttccactgc 

tctcgtaatc 

agttactata 

acctgcttga 

ctgtaagcgg 

ggcacatccc 

cctatcctaa 

ctgcgcaagg 

gtttggaggg 

gttaagattc 

gtggtttttg 

gatagggagg 

tattaatgca 

cattcaagga 

ccccaaggca 

aggagaggag 

gcacctgaat 

cccaaaatgt 

tgaccataca 

atttaggtgt 

acaagagcac 

gaggcctgag 

gcagccgaga 

gctcagttcc 

gcacaggaag 

gcctccgacc 

gggagggaga 

atgctcctgt 

aagaggggag 

caagccctta 

gtgccaaggc 

aatgcagagt 

ttaattctga 

ccctagcacc 

cttcctctcc 

gcctagtagc 

cccaattgtt 

cacctgccag 

agggggttat 

taaaacttgc 

ctaagtgctg 

ccagcacttc 

caggaagtag 

tggcttggcc 

aaaaaaagaa 

agcaccttca 

cacagatatc 

aaatcagaaa 

aaactgtggt 

tcccaggctc 

agagatccgc 

ccactttttc 

tcaaaaggtc 

ccccagcggg 

ccacccagga 

accgcctgcg 

tgctgcacgg 

tcaccatcct 

acctgccctc 

cgcggccgct 

ccaccccccc 

cgtgtccgcc 

accacctgta 

gagatctgcc 


atattagctg 

tctgaaatgg 

taggatctgt 

caatgctgac 

gggtgcctgg 

agaactcacc 

gtcccttgtt 

tcacacagcc 

aggcaaggtt 

caaatggatg 

aaatggaaga 

ggagataggg 

tagagaacgg 

agtggcaacg 

gaggctcaga 

gttagaatca 

tccctaggat 

ggattacccc 

tctctgggtg 

tttgagtggc 

caggccatgg 

ccaggatttg 

aggttgaggg 

tgtagcgaag 

gtggaggctg 

aggctcctgg 

gccacagtca 

tctgagaccc 

gacctcaagg 

tctctttgag 

attagctccc 

ccaggaatcc 

accatagttc 

aggcaggtgg 

tcccctgccg 

agtactttgg 

actgagcccc 

atactgtacc 

acataataag 

ctaaaataac 

agcctgtggg 

cctcatatga 

gtatactcat 

aagttaagaa 

taatggctaa 

tacatgctgc 

ctgagtgact 

ggctggctcc 

cctgccaatg 

caccccaaac 

aggggatcct 

cacagcaaat 

aaccctttct 

gtctgcaccc 

tgacttgttc 

gctgctcaca 

gaaccggctg 

ctacctgttc 

gctcgagttc 

ctgggcctgg 

ggagcgccag 

cgcggcaccc 

cgggg-tggcc 

tgccgaagac 


135360 

135420 

135480 

135540 

135600 

135660 

135720 

135780 

135840 

135900 

135960 

136020 

136080 

136140 

136200 
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136320 

136380 

136440 

136500 

136560 

136620 

136680 

136740 

136800 

136860 

136920 

136980 

137040 

137100 

137160 

131220 

137280 

137340 

137400 

137460 

137520 

137580 

137640 

137700 

137760 

137820 

137880 

137940 

138000 

138060 

138120 

138180 

138240 

138300 

138360 

138420 

138480 

138540 

138600 

138660 

138720 

138780 

138840 

138900 

138960 

139020 

139080 

139140 
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tcgcgggggc 

ggtgaggacc 

tcccgaggcc 

gtgccccacc 

tcccgtccac 

cctccctttc 

ccccc tgggc 

gccctcccca 

ggcagccgct 

cagagcgtct 

ctttggggga 

tcaacctcct 

gcacctggag 

tgtttgggaa 

acttgggacc 

ttggagaagg 

cgctcccccg 

gccaagagtg 

caaccctgag 

acagccctct 

gtcacataca 

ggcactgctc 

accaaggctg 

agcgtacaca 

aggtagaagc 

agggaccgag 

gcttgaggta 

gaagagctca 

gcacttggag 

gtggtcgctt 

ggccaagaat 

ttttaagaat 

gggaggctga 

ggcagaaacc 

ctatagtccc 

gactctgtct 

gcctagatgc 

aaatgatctt 

tagactccgg 

ccagggagct 

taatgggttg 

gcctgggcag 

ttatcattgt 

aagaggagtg 

acccctccca 

cccgagctct 

ccacccacgg 

gcacacccac 

ccccaaactc 

ggatgttcaa 

tagaaaggaa 

cctcccacat 

ttgtaattaa 

cccagccaca 

cattacctca 

atggaatctc 

aatttgcata 

taaaggtaaa 

ttctagcctt 

catactggcc 

aaaatttctc 

cagagccctt 

gcagcagaca 

accttgggaa 


gccagggcgg 

agcgagggga 

ctgcgctctc 

acctctgact 

ccggggctgc 

ccctcccagc 

tgtcctgact 

gttctggcca 

gacccgcact 

tccaccagca 

tgcctcagtc 

gacactggag 

agaactttcc 

aaaaaattta 

ccgaaagcgg 

ggagccgtcc 

ccatgaggag 

ggtgctggtg 

tggtctggcg 

tccagaccct 

cactcctggt 

cctcagggaa 

ggtttccgcg 

cctgcacaca 

catcagggat 

gaactcgggt 

caatcagaac 

ggctaagaag 

agaagtttgg 

gcacctgtaa 

ttgagactag 

ctagtttgga 

ggtggacaga 

ccgtctctac 

aggtactcag 

aaaaaaaaaa 

tagggcttga 

gagaagaaga 

ttaagggtgt 

cggagtctgg 

gattggaggg 

ggggagggga 

gaaatattca 

tgtgagcctc 

ccaccagccc 

cacacacact 

gtcccacaaa 

tcaatcagga 

tccaaacaac 

ggccatccag 

gggatttgtc 

tctgagtgta 

ctcctccctt 

gtctatccct 

tgcctggaat 

tccctccctc 

tgttatgatg 

acaccttagc 

ttctctcaca 

accagttatc 

agtctctcct 

tcccaaggcc 

gaggctcagg 

gcacagccag 


ggacgcgcct 

gcagatgtgc 

ggccgggctc 

gcggtgctga 

ggctccggcc 

tcctctcctc 

tgtggcagcc 

ttaactcttc 

cctaagggcc 

gagcctttgg 

agggccaggc 

gaatactttt 

tctccactgt 

ttaaaaaatt 

gatgaggtct 

cccgccatca 

gagctgaagc 

gcacctcttc 

cctgaggaac 

gttccttcag 

cacagacagc 

acacagcccc 

tacacttgcc 

gcaaccctgt 

gaaccaggtc 

gatcagaggg 

cttcatcttt 

gacttgggtt 

tagtttgaat 

tcccagctaa 

cctggacaac 

gtggggtgtg 

tcacttgagg 

taaaaattca 

gaggctgagg 

aaaatctagc 

ggaagcaggc 

gaagaaagga 

ggggtgggct 

ggtcttgcag 

acagccagac 

gctgctggta 

tgggccatcg 

aggaagaagc 

agacagaccc 

cgcaccaagc 

caggcaggtg 

cccagcagag 

ccagattcac 

cccaaccccc 

tgagattcct 

cccccattct 

tctctaattt 

tccttcccat 

ataaaagaaa 

ccacccagcc 

tcactcagga 

agagcatcaa 

atctcatccc 

atgtatgtga 

gaagtaggaa 

aagactgcac 

ctctggctcc 

gtttcagtcg 


actgaggacg 

cccggcgctg 

cccagccctc 

gatcgaagag 

ccagtcgccc 

cccggggagc 

ccaagagggc 

cccatcccaa 

cacagcggac 

aagctccccc 

tgaccctgac 

ctcctaagtc 

ggcccctgcg 

ctattatttt 

cagaatgtaa 

aagagcttcc 

cctgcattct 

tcttcatttg 

taagcctggg 

gaaacaagag 

cccaacatgg 

atgccagagc 

tccttggcta 

gggtatgtgg 

agagaagcag 

acaggaatcc 

ctggtgtgtg- 

gggatttagg 

ttggagctaa’ 

ttgggaggct 

atatcgagac 

atgtctcaac 

tcaggagttc 

aaaaaattag 

cacaagaatc 

ttgggaggtg 

tgaggttctg 

gaagaaatca 

gggggagagt 

gggcttgtgt 

aagagctctg 

aattagaggc 

gtccagatgc 

ctgaagcaaa 

acggacgccc 

agagccatgt 

tcccctaaat 

atcacctcca 

caccttgacc 

accaatgctc 

ctatcccctg 

gcattttcgt 

cctgcatcta 

tctccccacc 

actgcgataa 

ataccgtggt 

gcctgtaatg 

agatccctca 

aaaccttcac 

aacccaccaa 

acctcttccc 

ctctctgacc 

ctccaagcag 

tctagaacag 


actactgggg 

cctgccaggc 

tgctttgcct 

gccagtgtcc 

caccttccct 

aggccgcctc 

gtgtgtggtg 

ggctggggtg 

accagagggg 

agggag c ccc 

ccctgcttac 

taccctggac 

tggtgaagat 

atctactgta 

ggattgcagg 

tggtggctgg 

aggtgaggcg 

tccaggggaa 

gaagacctgc 

cagttctcct 

ctttgggtaa 

aaacacctta 

agtgcccttg 

tctctctctc 

gtttccaaac 

caaattggga 

gaagagaggc 

ggtgagtctc 

gaatctagct 

gacgtgggag 

tgagtctctt 

gtctgtaatc 

aagaccagcc 

ccaggcgtga 

actccagcct 

ggaatagaaa 

tgattctggc 

gcatctaagc 

gagagcagct 

caggctggct 

gtggaggagg 

aggctgtcca 

tatttcagaa 

gccactctcc 

atcacgtgca 

agcacgtgca 

tctgacatgc 

gcgatctcac 

cacacaccct 

tgatggggaa 

aaaaaagcaa 

ctgccagaca 

ttcagttacc 

acctccctgc 

cctcctcgct 

gaccagattc 

gcttcctaat 

gagtctggta 

tccagctaga 

ccgactttga 

cctcctcaga 

atatacaggg 

acggctgccc 

agaatgagca 


gggctacggg 

gcccccggac 

cctgctcctg 

gatccccgct 

ggccttgctg 

tccttgcctg 

gctcagccct 

gggcccccca 

cttttgtctg 

acccaggacc 

cctagtcccc 

actttttagg 

caaaagaagt 

agatttgttg 

gccaggaggg 

aggtggtgtg 

cagtgtggca 

gagctgcagc 

tgtctggtta 

gcaaggagga 

atgtgaacaa 

gcaaacagag 

tgcagtgcac 

agctcctgtg 

aggctagaag 

tgcattactg 

tggggactgg 

atcagactga 

tgggcagggt 

gatcacttga 

aaaaatgttt 

ccagcactct 

tggccaacat 

cggcgggtgc 

gggtgacaga 

gatagagggg 

tagggaggtc 

ctgaggcagg 

ggtcagaaac 

gtgaggaggt 

ggctgctggg 

ggtcatagaa 

cagtgaaagc 

accaaccccc 

cacccacact 

agcacaccaa 

acactgacat 

atgcgcagac 

gagataggag 

actgaggcca 

aattcattca 

cccagcctag 

cagtccccca 

tccagctact 

ggtttcctac 

atctgatcaa 

gcctataggg 

ccaactgctt 

acgtttgtat 

gtgcccccct 

tctcagactc 

gttcttcaaa 

ccgactggcc 

tctaaccgcc 


139200 

139260 

139320 

139380 

139440 

139500 

139560 

139620 

139680 

139740 

139800 

139860 

139920 

139980 

140040 

140100 

140160 

140220 

140280 

140340 

140400 

140460 

140520 

140580 

140640 

140700 

140760 

140820 

140880 

140940 

141000 

141060 

141120 

141180 

141240 

141300 

141360 

141420 

141480 

141540 

141600 

141660 

141720 

141780 

141840 

141900 

141960 

142020 

142080 

142140 

142200 

142260 

142320 

142380 

142440 

142500 

142560 

142620 

142680 

142740 

142800 

142860 

142920 

142980 
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tggggagagg actaggacac cagatgataa ggtttataag cccttaagcc tctaaggttc 
ttacacccag agtagggggg ggacggttct cagccctgtt tccctagctg cgggctccca 
attttcgatc cctaatccga gaggaactcc tctccaatga aatacagact tgggactctc 
aggacactgt ggaagggaaa tttcccaaca gactctgaga gtccaggagg ccagggatag 
accaggtggc aggcccaagg tccagctggg gtcaggtttc tatatgaatt tttaatgctt 
ccagatagac ttgtcagatg ttctgaaaac tgagcatctc ctttcacctc tgtacatgat 
gcccttctcc aaccccattg cccctgcagg agggcaggcc tgggacagat attcagtggc 
ctctggagaa acggttttgg gacagtagaa gggtaaatga cctagttatg ttcccactag 
taagctgtgt gaccttgggc aagttactta acctctctga acattagagt tctgtgggtt 
tgtttttgtt ttgtaagctg gggacaatag tgccagccta aatcaatttg ttgtggggac 
tcagtgcaat agcccatggc aaagtgacct acatgcttgc tgttattatt ctctttcctc 
aagttctgcc tccctcttcc agcttttctt ccaaccccaa agatgtctct ggctattgct 
tcgaaggtag gaactttggt tggttctccc ctttctcttc aggcccaaac tccccacctc 
aagat ccttt ggcctttgta gaaacttcag gtgaggaggt ggcagagaaa taagaaagtg 
tgcaaggctg gtggagtgag agaggaggat agatggcgaa gccctagcag aggggaggga 
agtgggcagt ggagagagg 


<210> 16 
<2 11> 215980 
<212> DNA 
<2 13> Mus sp. 

< 220 > 

<221> modif ied__base 
< 222 > ( 1001 ) . . ( 1100 ) 

<223> a, t, c, g, other or unknown 


<22 0 > 

<221> modif ied__base 
<222> (2123) . . (2222) 

<223> a , t, c, g, other or unknown 


<22 0 > 

<22 1> modified_base 
<222> (3728) . . (3827) 

<223> a, t, c, g, other or unknown 
<22 0 > 

<22 1> modif ied_base 
<222> (5168) . . (5267) 

<223> a, t, o, g, other or unknown 
<22 0 > 

<221> modif ied_base 
<222> (7481) . . (7580) 

<223> a, t, c, g, other or unknown 
<22 0 > 

<221> modif ied_base 
<222> (8849) . . (8948) 

<223> a, t, c, g, other or unknown 
<22 0 > 

<221> modified__base 
<222> (10375) . . (10474) 

<223> a, t, c r g, other or unknown 

<22 0 > 

<22 1> modi f ied_base 
<222> (12270) . . (12369) 

<223> a. r t., c, g, other or unknown 


<22 0 > 


143040 

143100 

143160 

143220 

143280 

143340 

143400 

143460 

143520 

143580 

143640 

143700 

143760 

143820 

143880 

143899 
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<221> modified_base 

<222> (13438) . . (13537) 
<223> a, t, c r g, other or 

< 220 > 

<221> modif ied_base 
<222> (15902) 

<223> a, t, c f g, other or 
< 220 > 

<221> modif ied_base 
<222> (15939) . . (16038) 
<223> a, t, c, g, other or 

< 220 > 

<221> modif ied_base 
<222> (18223) .. (18322) 
<223> a, t , c, g, other or 

< 220 > 

<221> modif ied_base 
<222> (20974) . . (21073) 
<223> a, t , c, g, other or 

< 220 > 

<221> modified__base 
<222> (24403) .. (24502) 
<223> a, t, c, g r other or 

< 220 > 

<221> modif ied_base 
<222> (27574) . . (27673) 
<223> a, t, c, g, other or 

< 220 > 

<221> modif ied_base 
<222> (30892) 

<223> a f t, c 7 g, other or 
<22 0> 

<221> modified_base 
<222> (30901) . . (31000) 
<223> a, t f c , g, other or 

<220> 

<221> modif ied_b as e 
<222> (34443) . . (34542) 
<223> a, t, o, g, other or 

<220> 

<221> modif ied_base 
<222> (38205) . . (38304) 
<223> a, t, c, g, other or 

<220> 

<221> modified_base 
<222> (42373) 

<223> a, t, c , g, other or 
< 220 > 

<221> modified_base 
<222> (42386) 

<223> a, t, c, g, other or 


unknown 


unknown 


unknown 


unknown 


unknown 


unknown 


unknown 


unknown 


unknown 


unknown 


unknown 


unknown 


unknown 
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< 220 > 

<22 1> modified_base 
<222> (42393) 

<223> a, t, c f g, other or unknown 
<22 0 > 

<22 1> modif ied_base 
<222> (42461) 

<223> a , t, o, g, other or unknown 
< 220 > 

<22 1> modified_base 
<222> (44809) . . (44908) 

<223> a, t, c, g, other or unknown 

< 220 > 

<221> modif ied_base 
<222> (51380) . . (51479) 

<223> a, t, c, g, other or unknown 

< 220 > 

<221> modified__base 
<222> (56740) 

<223> a, t, c, g r other or unknown 
< 220 > 

<221> modified_base 
<222> (56765) . . (56864) 

<223> a, t, c, g, other or unknown 

< 220 > 

<221> modif ied_base 
<222> (62818) . . (62917) 

<223> a, t r o e g, other or unknown 

< 220 > 

<22 1> modif ied_base 

<222> (68518) 

<223> a, t, o f g, other or unknown 
<22 0 > 

<221> modif ied_base 
<222> (68534) . . (68633) 

<223> a r t, 0 , g, other or unknown 

< 220 > 

<22 1> modified__base 
<222> (74552) . . (74651) 

<223> a, t, 0 , g r other or unknown 

<22 0> 

<22 1> modif ied_base 
<222> (81446) . . (81545) 

<223> a, t , 0 , g, other or unknown 

<22 0> 

<22 1> modif ied_base 
<222> (88519) . . (88618) 

<223> a, t, o, g, other or unknown 

<22 0> 

<221> modif ied__base 
<222> (93791) 

<223> a, t, 0 , g r other or unknown 
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<22 0 > 

<22 1> modif ied__base 
<222> (93794) 

<223> a, t, Of g, other or unknown 
< 220 > 

<221> modified_base 
<222> (96565) 

<223> a, t, c f g, other or unknown 
<22 0 > 

<22 1> modi f ied_base 
<222> (96570) . . (96573) 

<223> a, t r c r g, other or unknown 

< 220 > 

<22 1> modif ied__base 
<222> (96579) 

<223> a, t , c, g, other or unknown 
< 220 > 

<22 1> modified_base 
<222> (96590) 

<223> a, t, Of g, other or unknown 
<22 0 > 

<22 1> modified_base 
<222> (96596) 

<223> a f t r o, g, other or unknown 
<22 0 > 

<221> modif ied_b as e 

<222> (96602) 

<223> a, t, c, g, other or unknown 
<22 0 > 

<22 1> modif ied_base 
<222> (96616) 

<223> a, t r c r g, other or unknown 
<22 0 > 

<22 1> modif ied_base 
<222> (96629) 

<223> a r tf Of g r other or unknown 
<22 0> 

<22 1> modif ied^base 
<222> (96633) 

<223> a, t f Of g r other or unknown 
<22 0 > 

<221> modified_base 
<222> (96668) 

<223> a r t f 0 , g f other or unknown 
<22 0 > 

<221> modified_base 
<222> (96715) . . (96814) 

<223> a, t, Of g r other or unknown 

<22 0 > 

<22 1> modif ied_base 
<222> (104447) . . (104546) 

<223> a f t, Cf g r other or unknown 
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< 220 > 

<221> modifiedjbase 
<222> (114521) 

<223> a, t, c , g, other or unknown 
< 220 > 

<221> modif ied_base 
<222> (114527) . . (114626) 

<223> a, t, c, g, other or unknown 

<22 0 > 

<221> modified__base 
<222> (127063) .. (127162) 

<223> a, t, c , g, other or unknown 

<22 0 > 

<221> modified_base 
<222> (139133) . . (139232) 

<223> a, t, c, g, other or unknown 

< 220 > 

<221> modified_base 
<222> (151051) 

<223> a, t , c, g, other or unknown 
<22 0 > 

<221> modif ied_base 
<222> (153242) .. (153341) 

<223> a, t, c, g, other or unknown 

< 220 > 

<221> modif ied_base 
<222> (164706) 

<223> a, t f c, g r other or unknown 
< 220 > 

<221> modi f i ed_b a s e 
<222> (164708) 

<223> a, t r c, g, other or unknown 
< 220 > 

<221> modified_base 
<222> (164710) . . (164809) 

<223> a, t, c. r g r other or unknown 

<22 0 > 

<22 1> modified_base 
<222> (182242) . . (182341) 

<223> a f t, c r g, other or unknown 

< 220 > 

<221> modified_base 
<222> (192158) 

<223> a, t. Of g, other or unknown 
<220> 

<221> modi f i ed_b a s e 
<222> (192192) 

<223> a f t, g, other or unknown 
<22 0> 

<221> modif ied_b as e 
<222> (198842) . . (198941) 

<223> a, tf c, g, other or unknown 
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< 220 > 

<221> modified_base 
<222> (199437) . . (199438) 

<223> a, t , c, g, other or unknown 

< 220 > 

<221> modi f ied__bas e 
<222> (208276) 

<223> a, t. Of g, other or unknown 
<22 0 > 

<221> modif ied_base 
<222> (215974) 

<223> a r t, o, g, other or unknown 
< 220 > 

<22 1> modif ied__base 
<222> (215976) . . (215977) 

<223> a, t, c, g, other or unknown 

<22 0 > 

<221> modified_base 
<222> (215979) 

<223> a, t, c r g, other or unknown 
<4 00> 16 

ttgggggtat aaacccagaa gtgggattac tgcaccatac aataatcctc taacttcaag 60 

caatttttcc acaatggttg tatcatttta cattcccact ggctacgaga agggttccca 120 

cttctacaca tcttcaccac catttctgtt tttgtttttg agtaacagct gcctaatgac 180 

tgtgaagtgg tatcttatct cagtgttgat ttgcatttct ctgatcatta atgtgggaag 240 

gcatcgtttc atatgtttat tggctgtttg tgtatcatct tctttggcga tgttgattca 300 

agttatttgc ttgttttttt aattggagtt ttaaaaaatt gttgttgagt tgtgggagtt 360 

cttcattagc tctgcatatt aataccctga tgaaaatgat taacaagtat ttgcttccat 420 

tttgggggct tccattctgg gctgttttta ttcttttgat actcttttga ttctcaacag 480 

tttaatctga ctaaaattca gtttatttct tcttttaatg gccatgctat tgacacatcc 540 

cgtaatcact gccaaatcca gtcatgaaga gtttctttca agagatttat agttttagct 600 

ctttaagttt gtcatgtctg tttcacttaa ttttgtatag tgtacaaaag tctaacttca 660 

ttcttttcta tatggcttgc tactagtata cgaagagcta aatttctctt tccttgagtc 720 

t caacctctg atgtgtagca atttcttcag aggaaaacat ggtgggaagt tccttaaaca 780 

taggatgctc catggaggtg aaatagttca tcctacaggg aagcttgtta aacacaggaa 840 

gtacatactc agcagctcta gtaagtgagt gaaactgact ggaggcacta ggtccctcct 900 

tccctacgca tatagaagct gtaaggattg ggaagagata ctgtcaggtc agctcagctg 960 

ctgcccggaa gaagctcaga cccactggcc tggctccaag nnnnnnnnnn nnnnnnnnnn 1020 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1080 

nnnnnnnnnn nnnnnnnnnn atcactcttt actcaggcca cctacacgct gtttatagcc 1140 

tgcctttgtc tctttggcta tacttcctgt ttatgtctat gcctcccctc tttctttttc 1200 

tttctcttct cttctcatct catctcatct ttcttcaggg gggagcctgg tctagaactc 1260 

acaaagattt gactgtctct gtctccttgc actaattaaa aaatctttta caagcatctt 1320 

ttagcaattc ttacagggaa attttggaat gttaaactct gattgttagc gggctgaaga 1380 

taacaatagc tctgatgata aattgcttgc caggcaagtg tgaaaatctg agtttgatcc 1440 

aaaaagccgg gtacagaggc caaagagtcc ataatcctag taggggcagg aatcagggat 1500 

gggtgggtcc ctggggtttc ctggtttgtc agcgtagccc aattgggaat agccaggttt 1560 

cagtgaacga tgctttctgc aagctgagag aggtccttgt tcaatctctg tgacccaact 1620 

ggagggagaa gagagccagc tctccagaag tggtcctctc aactttgtgc atgcatgtcc 1680 

atgttcacac agggaatgga taatgcttaa aaggaagacc ggcagggggt tggtaatgca 1740 

cctcctttgg tgacatgctt tcctcttgtt catgctgctc caggtgtggt cggcagcacc 1800 

aaaaaccagg tgtatgtttg taatcccagt attctctggt cgtcagtagg aaatgaaaag 1860 

cgaggtcatc ttcgtataga gttagcaaac tctaagccag cctcggctac atgagacttt 1920 

gtctcaaaac aaaggaaaaa tcaaggagga cggctcccga gcactgtcac ctgaagctga 1980 

cctctggcct ccacatgcat gtgcgcaaac acatgtcctg cacaaacaca cagacacccg 2040 

catctgctcc ccgacaaaag aacctgaaac cagtatactt tgagaatttc ccattcatag 2100 

ttaccattgt gtgttccttg tgnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2160 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2220 

nnggtggtgc ctttctctta cccagtctag aagggctgga ggcagggtgg atggggcact 2280 
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ttgaactccc acctaggcaa aaaacccagt gatctctggg ccagtgtgtt gtttgcaagg 2340 
gaataaggta gagagccgcg gaggaagaca ttgggggttc tatgagtatg tgaaggggtg 2400 
cacacaccac acacacacat tttttttgtt ttaaatttac aaacattaaa ataggctgta 2460 
atgtggctca gtgggtagaa aaacctgctg tctaagcctg gtacgagttc aatccctgac 2520 
aagctggaag gagacaacca accacaactg ctagcagcca gaagcactgc ttgctaacac 2580 
tcaagagagc ctggagtgga agacactgga tccccagcag gcaagcctgc aagaagatgt 2640 
gccttgccta gacaacggca gaacaaacat caaggctggc agagctgtcc aggactgttc 2700 
atattaatca tgtatagata agagggaatg gcacagacag aacaattcaa cacacggggt 2760 
atgaaaggaa aggaacaagg cacacaaagg acaaagaacc tagcatacaa gaaagcctaa 2820 
gcagagagtg gcacttccca gaagggagtc ataaaataga ctgaattcat taaaacaaga 2880 
gccaaagata aacggctcaa aaaactcacg gaaaacaggt caaaataacg tcacccatct 2940 
gacagttgat actgtcaact taaccgtatc tagaactcca gcaggcacat ctccaggcat 3000 
gcccctgaag gggtctttgg actaggttaa ctgacgtggg agtgacacca tctatggacc 3060 
gaagcctcag acagaataaa aaggagccag tgagctgagc gtcagtgctc attgcttctg 3120 
gcttcctgtc tgtggctgca gcgagacacg gtgcttcctg ctttagctgc catgacagac 3180 
cacaccctca aaccgtgaac caaaataacc tcctctctac attgctttta ccaggcattt 3240 
ggtcacacca atgagaaagg ttaactaata cagcactcaa tacttaaaaa cataaacacc 3300 
aaccttgttt gcatgtgtga gactttgaag ctcacgggcc agttatgccc aatgccaggt 3360 
ctgctggcta agggtgagag tgcacaccta taatcccagc tgctgtggaa tcagcaaaag 3420 
cgctacagat ggaaggcagc cagggcagct gagactgact caaactgata gaggtgggag 3480 
gcatagagaa aaccagatta atagagtgtt ccccactatg caagaagccc tgggtttcag 3540 
gacgagagaa ctaagaatac agaagtctac tgtgtagaag cactgctagg tcacacagaa 3600 
acatcactca agtgtctctg gatgctacac ggagggcgtg tgaagtattg cttcctgatg 3660 
atctgtatct actacagcac tgctgtttta gtatgcgctc ctccactaca gctcctcacc 3720 
acaccaannn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3780 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnaat taatcaaaga 3840 
aaacacacac caccagttag agaaagttaa tcaggccgaa tggcggcttt cccctgtatc 3900 
caggctaccg tcaggacggc tcactgccac tggcaactct gcctgaacaa agcccgcagc 3960 
caacgtgggc ttcaggggct ctaaacactg caatcaaagg ttgtgtgtgg gggtgggggt 4020 
gctgctgcta ttcaaggatt cccaaagctt agatgtattc atcatactca caggaaagcg 4080 
tgttcaaccc atcactcatg agcagtcggt accggggtga cc tattccct gtagaaatgg 4140 
gacggatgtt ctggaaaagt tgacagaaaa gttgattcat taggcaggct ctttgcccaa 4200 
gccctgaggg taagcaaagc taactggcag gagactaggt ttgccattaa tctgagacaa 4260 
gatgaaccac ttgcccatcc tcctgacacc taaatactaa tgaaagaaca atggattgag 4320 
ctggcattat taaaaacgat agaaacagaa gtatcaatag tcatgtgttc tttctcccat 4380 
atgtcaaaac aatgtgtaag atggcatcga acacatgcag aaactgttta gggaacatgc 4440 
tgaaaatatg aagtaaaatt aaaattggaa agaaagacaa tttgcctaaa gcagctcaga 4500 
gctggagaag ggaccgaggc agagataaca gcaacgtgtg gacatacgga tctggggcag 4560 
agcagtcacg gactcagccg gaaagggtgg ggcagcctct gaaggaagtt aaggtaaata 4620 
gagccacaag gtgattggcc caggagtggt gccaccttca cctcctgcct caaagtctga 4680 
aggaatgatc ctggagtctc ccatctattg atatatgaaa ttcacagtat gttttagaac 4740 
ccactgaatg atgggtagat taactaaaag aaatttaagc ggggtggtgc aggtctttta 4800 
atcccagcac ttgggaggca gaggcaggtg gatctctgtg agttcgaggc cagcctggtt 4860 
ccaggacagc cagagataca tagagaaacc ctgactcgaa aaaacaaaat taaaagctca 4920 
tcaaaacaac aacaacaaca aaaaaaacaa aaaaacaaaa caaaacaaca aaacacccta 4980 
tagtacctgt tggtgagttt gagtgagtga gtgagtgtgt gttagagaga ggggcgggga 5040 
aagtgtgttc tggaaatggg agaaagagaa tgtgcatgtg tgtttctggg atgtagacaa 5100 
aactacatgt cttccatcaa atgcaatgtt taattatcta tgagttgaac catcttcatt 5160 
ctgctaannn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5220 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnaac aaaaataaac 5280 
caaaccagta aacaaaatcc tgtaagataa agcctaagac aagacacttc ctggggctgg 5340 
ggagttgctt agaccataag gagttcataa tccaggcgtg agagcccgag ttcaggtccc 5400 
tgggctt cca agtcaggagg agaccaagga atcaacaagt ctcgactttg gtctctagtc 5460 
ccatgcacac acacgcgtgt aaatacgtag atgttcactc acacacagaa gactgcacct 5520 
ggctctctca catctcagcc aacatataaa gcctgcatta tcagaacatt ctaggttcta 5580 
gtttcagtca actcttacac agaatggcca tcatactccg tctacaactt ctcctgatct 5640 
acccacgtgt cattgcttca gtattaacaa aacccagaat aaccagctgc gtagatcctc 5700 
cctgatgccc cagtcattgt cttactgaga ctactaagtc acaaggtagc actctggatc 5760 
caaaaagcaa tatccaattg agagttacaa cctataagga ggagtttacc ttcattatag 5820 
ggcactggat tcccaatctt taatccaacg tcttcagcag atttcataac ttccaagtcc 5880 
atcaaaacaa ctactttcct acaaagacag acacaagtta gaattaagaa ctctgcagcc 5940 
tttcagatga gttactaaga agcttacttt agtagttgtc tggctaaaac tgtatccttt 6000 
accaaccttt tctcattctg gactaacttg agaagtatta attcctaagt aaatacttca 6060 
cttattcttt ccccacatct ccaatgtttt tgtctttaat ttattatagg gcaattcatt 6120 


70 



WO 02/29059 PCT/US01/31488 

tcctatctag ttccctgatt aaaacagtag accttgctgc atgccattat cctcatggag 6180 
gcactgatac aatttagatt attaaataca aaaccctaaa acacaaaaag atgatttttt 6240 
tttaaaacaa gattttaaaa aaagcatgtg ctacgcttcc ttctgccact aagcctacac 6300 
atggtcctct gactgaattt ttcccctcat tctgcttcat ctaatatgtg cttttcaaac 6360 
ctggaattga accagggact tattcatgct aggcaaatgc tctaccatag agctataccc 6420 
ctccaactcc catctcaaat atcatttcca aagacatttt cttggtctct tatttagatc 6480 
aggtttcttt gtcctcctgc agctatgact tcattccttc agaacactcg tcttagcttt 6540 
aagttctgta ttaattagtg attgttttca ttctctctgc tagaatgcac tttcaataaa 6600 
ggcaggtagc cagccacagt gcttaattaa gcaacagccc aacgatgtca ttcactacat 6660 
actgggacaa gatgcctaac atcatctgca gataaagacg aactactggt gtcaggagac 6720 
agctaagggg tccagggctt gggcacgctg agtgtgagca ctggagtccg ggtgcccaga 6780 
aacgcacata aatgcaatat ggatgtggca atctacctct aattccttct ttaagacagt 6840 
ggctctccag agcaagctgg ctagcaagac aagccatatc agtgagctct gggcttgacc 6900 
aagaccctgc ctccaggtgt aactcccaag caaaaggatg atggctcaca aatctcaggc 6960 
tatcatgttc atgtacaaaa tgtcaaccgg catacacaca tgcacacaca tgaaaactgg 7020 
gagaaaataa gaagaattgc aaccaaaaaa tgtaatttga ggacacataa ttgcaggcgg 7080 
ggagtggggg gatgacagaa ggtgaactga gtggaccgag ggaaagctgt gctagcggca 7140 
atgagaagaa gggtggggca gtctgagcaa gggttcagca atcaccacgc tttactgtct 7200 
gcacagcctg gctgtagaat gctgggcttt atcacacaga attattcagt atgtgctatc 7260 
tttacagtaa agttattcta tcaggctatg ctacttcaat agaacaagcc tgaaaaagtg 7320 
gtctgctgct gagaacctga caaagatgac ctgttagaac tgtctgccaa gtgtggaatt 7380 
ccagcactgg ggaccaggag ctcgagggtc accccagatg cagggagtta gaggccagtc 7440 
ttggcaacat aacatcatgc ttcagaaatt aaaaacaaaa nnnnnnnnnn nnnnnnnnnn 7500 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7560 
nnnnnnnnnn nnnnnnnnnn catgagatag ttaataaact gaagaaagcc atacaaggag 7620 
taaagtagat agttgcaagc atgaagaaag acaaaccact tgagcttttc ttttgtcgta 7680 
aggaggaaac cagacaggtc cagagagatg gctcagagat- taagagcact gactgctctt 7740 
ccgaaggtcc tgagttcaaa tcccagtaac cacatggtgg ctcacaacca tctgtacagc 7800 
tacggtgtac tcatatacat taaataaata aataaataaa taaataaata aatcttaaaa 7860 
gaaaaaaaaa aaaaacctaa ccaatcagcc aggcgatggt gacacatgtc tttaatccca 7920 
gcacttggga ggcagagaca ggtggatttc tgagttcgag gccagcctgt tcttcagagt 7980 
gagttccagg acagccaggg tgatacagag aaaccctgtc tcaaaaaaca aacaaacaaa 8040 
caaacaaaca aacaaaaaag gaggaagcca gacaggatgc actttatacg tgaatggaat 8100 
tgacaaaaga caagttctat aagtgttagg gaaaggggga ggacaacggg ggttcatgtc 8160 
tgtggtggaa cacgtattag aaggctctgg gtatcctgtt tccgacaaac aggcactccc 8220 
aatcacacag gccactggat gtctcaggca gagaaagatg tgatagattg actttttaac 8280 
aatcacagac tgtgtggaaa atatttgtaa ggttgtcatt gtcacccagg atagagctga 8340 
tggttattca aacgaggatg ggacaacaga aatgggagag agggatgtga gaaccatttt 8400 
gaaccagggt gatttactgc gcacgtgtat agggtctaca gggagtggga tatgtagagg 8460 
aggcctatgt tcctaacttt ggtaatgagc ttattacagt tactatgcac agcctggaag 8520 
atactggaaa aggtgcaggc taggctagaa aggtactaac tgagggtttg acagcccctt 8580 
ggatgtcagg atgcagcaag cctacctctg tatgtagtca atcccttctc aggctatggg 8640 
tcctgcagat catccgtctc tgtatccatt attcccagtc catcctctga gtggctccct 8700 
cttatccagt ttaacaaaat gctgactgca agctcccaag cccagggctc tggctccttt 8760 
actccttgtt attgtacttt accctgtttg cttgggatag agtgtgccct ttataaacat 8820 
ttgtgaaagg gggaatgaag aagaataann nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8880 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8940 
nnnnnnnnag agagctcaat ggttaggagc actggatgct cttccaaagg tctccagttc 9000 
aattcccagc atcaccatgg cagctcacaa ctgtctgcaa ttccagttcc aggggattca 9060 
acactcagaa acataagtgt aggcaatcta cgtaacataa aaataaataa atgagctgga 9120 
aaagaaaaca tgtttcaaaa tatacaagta atggggctgg aggagatgtc tcaatgggta 9180 
agatcattgg ctgctctttt ggaggttctg ggttcaattc ccaccaccca catgacagct 9240 
cacaactgtc tgtaactttg gtcctgtggg agctgatgcc ctcttctggt gtgcagacat 9300 
acatgtagac aaaacacctg catacataaa ataagttttt aaaaaagtta cacatacacc 9360 
cgtgtgtaat ataacacaca ctggcttaac ttcctcagca ctgactgttc accatacgga 9420 
ttcccatgag gttttggttg cattctatca ccgaaaaaaa aaaaaaaaaa ttagaagaaa 9480 
gtatatacat ataaacctct ccctaaaata aagttttctt ttctaaaagt acatccttat 9540 
ttttttattt tttttttttt ttaagaaatg ggaacaacag ttctgctcac actgtatttc 9600 
tagcatgtaa catcttgcaa gtacttaacc gtattctata tcagctcaac acacttacta 9660 
ccgaagactc aagatcacaa aaaaaaaaaa aggacccaga ctggataatt aaacgtttct 9720 
tttgttgtag taagcgacct cttccttaga agatactaca gtaatgctga agaaatgaca 9780 
catctactgt aatctgttct ctgggattcc aacttgtttc ctctgctact cctcccttgg 9840 
cggcaatgtt cgtctgcatc cggctgagct cctcgctgcc ttgttaaacc tccttcctga 9900 
acttccgacc tgtagttccc gctctacagt gcaagcgagt ggataaggaa gcgcatacct 9960 
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atgtatgtag 

catatccagc 

atggatgcca 

aatagggatt 

tctcactaga 

gtagtgtgac 

agaactgttt 

gctagaccca 

gctctcagct 

aactgtaagc 

cttcacagaa 

taggaggaat 

cgtcctaacc 

ctcagatcca 

aattaaatgt 

aagagtactg 

tcacaaccat 

atacccacat 

tgctttggtc 

acaagtgagg 

gcttgtcgaa 

nnnnnnnnnn 

tacactgtac 

gaactcagac 

aaataattct 

acttataagt 

tgttttgttt 

ccaggctggt 

tgtgtgcacc 


acaggtacca 

tgcttatatc 

acgtaaactt 

tcctcgtacc 

gtttcccatc 

aaaatggctt 

aatatatgta 

ttgcctaagt 

catattctgg 

atccaggcgc 

nnnnnnnnnn 

caaagatcct 

tccccctgag 

tgagacttcc 

tcagagcctt 

tgtttcataa 

ttagggagag 

gccacttact 

agtcttgcac 

tgtataaaca 

gtagaatttt 

tttatgccaa 

ctgatccagt 

ctgttgagta 

ggttcaagac 

ttgaccttta 

taatgtaaaa- 

ctctttttca 

tctgtttctc 

taggccttat 

accatgttca 

tctaagggaa 

aattgctgtg 

tcctcctcct 

tttttgaggc 

accaggctag 

aacacttgct 

attactaaat 

ttttttctac 

taggaaggag 

cgagaatacc 

agaaagatta 

gtctttcaag 

actactctag 

aagcctccaa 

atagtaacct 

ggccttgtta 

atgtgagaac 

cctgcaccat 

tgtacttata 

actgctcttc 

ctgtaatggg 

acattaaata 

atggtgtctg 

aaaggtgttg 

gcgnnnnnnn 

nnnnnnnnnn 

tgtcttcaga 

ctttggaaga 

tactagtaac 

acatgattat 

tgttcttttg 

cttgaactca 

accacctctg 


tttacacctc 
tgtatgtgtc 
ggatagtagt 
ttttcccctg 
ctctctcttt 
tgccgtgtta 
tatattacaa 
atattgca cc 
aactggaatt 
cannnnnnnn 
nnnnnnnnnn 
cagtttcgac 
atttactagt 
tagctatata 
tgtcatgtca 
aaagtgtagt 
ctgacaactt 
tgggctttct 
atttcctgtt 
ttttctccca 
aaaattgctg 
tctacttgac 
atttgcacat 
catatatatt 
agttgccctg 
tcaaattttt 
tttttataaa 
actcttagaa 
tgttcttaag 
atacaaaagg 
aacatatggg 
tatcttcact 
tttgttgatg 
cttcttcctc 
atgatttcac 
cctctgtctc 
ttgagatggt 
tgttacttcc 
tttgcaagtt 
agagaggagg 
cagtaggctc 
ggaagcatgt 
ggagggggcg 
tgtcaagcct 
ttaaatgctt 
gtggtgattt 
agaggaagtg 
cactcttctc 
gtctgcctgg 
agaattgttt 
cagaggtcct 
atctgatgcc 
aataaataaa 
ttcacagcag 
tgtgacactc 
nnnnnnnnnn 
nnnnnnnnnn 
cacaccagaa 
gcaatcagtg 
atggaacaat 
gcacatttat 
aggcccggtt 
atgatctgcc 
gctatcactg 


cgtgttcggt 

tgtatatttg 

gggttgatgg 

tttgtttcta 

tgctgatggt 

tttcaatatg 

gatagattaa 

ttgtgtgtct 

gcaggtggtt 

nnnnnnnnnn 

nnnnnnnnnn 

tttgattacc 

ggaagaaagg 

attaggccga 

agtgtatttc 

tgggctcttg 

tacagtatta 

tttatgccct 

agatttattt 

attacatacc 

tatgaatgac 

atgtttaggt 

cttactttct 

taaggtattt 

tgttcctcaa 

aaaactatct 

aatagttata 

tcagagtatg 

tcacttatat 

cacccatcac 

cctttgaggt 

tctagcagat 

ctgctcttcc 

ctcctgcttc 

catgtagccc 

ctgagtgctc 

tagaggaaaa 

agtcaaactt 

ggctccatta 

gaaggagaga 

atatatttaa 

cttaggggct 

gtctgtctct 

gtgtgcttcc 

tcttttatag 

taatatgcct 

tgtctctgtg 

ctattggcct 

aagctgcctt 

ttggggggct 

gagttcaatt 

accttctggt 

taaataaata 

taaaacccta 

tggatctctg 

nnnnnnnnnn 

nnnnnnnnnn 

gagggtgtca 

ctcttaactg 

caagttttat 

catatcgtgc 

tctgtgttgt 

tgcctctgcc 

ctgtcttttt 


17700 

17760 

17820 

17880 

17940 

18000 

18060 

18120 

18180 

18240 

18300 

18360 

18420 

18480 

18540 

18600 

18660 

18720 

18780 

18840 

18900 

18960 

19020 

19080 

19140 

19200 

19260 

19320 

19380 

19440 

19500 

19560 

19620 

19680 

19740 

19800 

19860 

19920 

19980 

20040 

20100 

20160 

20220 

20280 

20340 

20400 

20460 

20520 

20580 

20640 

20700 

20760 

20820 

20880 

20940 

21000 

21060 

21120 

21180 

21240 

21300 

21360 

21420 

21480 
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ttttttttta 

catggtatgc 

tgggttctag 

gccatcctgc 

caaacaaaaa 

aagtcctgaa 

aagccttcct 

tttctcctct 

accctgccta 

gacactgtcc 

cgtttccaga 

gataaaaagt 

ggaggcagag 

aggacagcca 

aacaaacaaa 

ccaaaaacca 

agaatttcac 

aggcattcat 

tgcttagtca 

ggagtggcct 

tcacatccaa 

acacaccttt 

ttgaaagtga 

agaggaaaga 

accagggcat 

agagaatgag 

gcattttagt 

agcagcaaca 

ttactcagca 

ggctagacgc 

gacaacagtt 

ccagcgtcat 

caacttttcc 

gggaaggttc 

gtcactgtta 

tgcagatcaa 

ttataaaatt 

gttggtttgc 

ggtggcagat 

gaggacagcc 

gtcttgaaaa 

agctttcaca 

tattcattct 

catggataaa 

acattctact 

ttgtgtctat 

tgctccatcc 

atgtatgtat 

acacttggga 

nnnnnnnnnn 

nnnnnnnnnn 

gaaccgtgtg 

tagtttaaat 

accatgtgga 

gagtgacaga 

ctcttaatag 

tttattttgt 

tcccctcaaa 

accacatgct 

gccttagaga 

aatgcacact 

ggttgtgagc 

ggtgctctta 

gtaaggtggg 


acagttattt 

atgtggaggt 

gaaccaagct 

tggccctact 

ataaacctca 

cgaatgtgtt 

ctggtcctgt 

ggatgactgt 

cttaagctgt 

atcagtgaga 

accaagcctt 

tttcaaggag 

gcaggcaaat 

gggctataca 

caaacaaaca 

aaaaacaaca 

tgtttaaaaa 

gtgtgtagtg 

ttgagtagca 

tgttgcagga 

agacattcta 

aatcccaaac 

cgtttaatta 

gaggcagcat 

ttgtacagag 

aaggagccag 

cagaaactga 

gctgagttaa 

gcaaatctca 

ttccagggct 

aatacagaag 

gcctgtctgc 

agagatggtc 

caagagactt 

tttagtataa 

cataaagatg 

cctgatttga 

cagaaagatg 

gcctttaatc 

tggtctacag 

acaaaacaaa 

actacaccaa 

agatcaggtc 

ctgttgtttt 

gtaactttcc 

aaaggtatag 

acccaaataa 

atacatacat 

agttgatgag 

nnnnnnnnnn 

nnnnnnnnnn 

cctctgcctc 

gtttcttatt 

tgcatggtgc 

gggttgtgag 

ctgagcctta 

tttgagacag 

gtaaagatcc 

cagcacagcc 

taaataaatt 

gtagctgtct 

caccatgtgg 

cccactgagc 

atcatctcag 


atttcgtgca 

cagaggacaa 

caggttgtta 

ataggtcctt 

aaaaacaaaa 

ctgttgacat 

gaggatgctg 

aaccccagat 

acacaatgaa 

gcacagccta 

gctggataga 

ggccaggtgt 

ttctgagttc 

gagaaaccct 

aaaagccaaa 

acaacaaaaa 

aatggctgaa 

gtttaaataa 

gtactagaga 

attgtgtcac 

ggtctatgtg 

aatgaaggta 

agagtgatga 

aagagggagc 

acaggttgca 

gagattagga 

gagaagtcaa 

actagccaac 

gaggctaaaa 

aggcctaggt 

actatgtaca 

attgttagga 

cactgttttt 

catcccagga 

tgactcctga 

aactttcaca 

tttaaataat 

taataacgtc 

ctagcacttc 

agtgagttcc 

aagaaaagta 

aaagagacat 

caaggatgaa 

gagcttaaag 

tttttataaa 

aaggacagag 

atatgtgtgt 

gtgtaggtag 

acaagtgaga 

nnnnnnnnnn 

nngaattcac 

taaagtgctg 

tttttgttta 

ccttagagtc 

ctgggacttg 

tctccaggcc 

ggtgtgtagc 

tcctgcctct 

atgattcata 

attttttttt 

tcagaccccc 

ttgctgggat 

catctcacca 

tctccgggaa 


tgcatgtgtg 

ctttcaggag 

gacttgcatg 

atataaaaag 

ggaccatata 

gtctgtactt 

atagtctgtc 

tgttcctctt 

tgagctgagt 

gcctaacagt 

gcatatttgt 

ggtggcacac 

gatgccagcc 

gtctcaaaaa 

aatccaaccc 

gtttttgagg 

taatatttca 

aaatagcccc 

gggaattgaa 

tttgaggtcc 

atctggctgg 

aagttagttt 

attagagaaa 

atggcagagg 

gagctagaac 

cagattgcca 

attgaatcag- 

agaatccaga 

acatcttaga 

tagcagacgg 

gacatggata 

gtcatctagg 

tgcatggctt 

ttgctgctta 

gctttagccc 

aattaatgct 

tttaggaaga 

agaatcaaga 

ggaggcagag 

aggacagcca 

agtaaaggct 

gcttgggaca 

gccacaagtg 

aataaaacac 

ttttatctat 

agaaagaaat 

gtgtgtatgt 

gtatatgtgt 

ggttgggccc 

nnnnnnnnnn 

tctgtaaacc 

ggattaaagc 

tgggtctttt 

cagaagaggg 

aacctaggac 

gtcccatgga 

tcatgcttga 

gtctacagct 

acaataaaaa 

taaagattta 

cagaagaggg 

ttgaacttcg 

gccccgagat 

tcttgcctgt 


tataagcatg 

ttagttcttt 

gcaagtgcct 

atcatatgcc 

atattgtggg 

cagacccatg 

taaaaactag 

cagagactgt 

ttccaggtta 

acacatgtct 

ctgtttggct 

gcctttagtc 

tggtctacaa 

accaaaaaaa 

cccccaaaaa 

tttaatttat 

ctatccattc 

cataggcttc 

ggtgtggcct 

cagcaacaag 

aattcagaca 

ataaaaagaa 

gatctgctgt 

gagagggagg 

aggtgaagac 

atgttaatag 

ttagcttgga 

aagaactaga 

cctagattag 

agagagtaat 

tgaacctctc 

aaaggctaag 

ttgagaggcg 

attacgacat 

atcctattgg 

gtttagatga 

aagctttagg 

atagaatgtg 

ataggcggat 

gggctacaca 

gcataataaa 

aatttgtgat 

tgtgatatga 

tgctttgaaa 

gagataattt 

aaggtgtggc 

atgtatgtat 

atgtatataa 

ccnnnnnnnn 

nnnnnnnnnn 

atgctggcct 

atgtaccacc 

acctgtatgt 

tatcagatcc 

ttctaaaaga 

tttggggggc 

atttactatg 

gctaggattc 

aagaaagaga 

tttatttatt 

agtcagatct 

gaccttcgga 

aaataaatta 

tactccttcg 


taacgtatgc 

cctcccactg 

ttaccacaga 

gggcaaaaac 

ggagtggatg 

ggaattggca 

agatcacagc 

ccaccaagct 

cagcacagta 

gctttcttca 

tatttcactt 

ccagcactcg 

agtgagttcc 

accaaaacaa 

aaaaaccaaa 

tgcatgtcac 

acgtatttgt 

tacagttgaa 

tattggagta 

gttgctctga 

tgcccttaat 

gcacccatgt 

cacagagcag 

aggggttttc 

agaacaagcc 

gctaagcaga 

aaggagtttg 

aaagatgagc 

actgcatgga 

aagccttgga 

agccacttct 

ggcaggcaag 

agctctgaga 

gccttttctt 

gcatatttcc 

ataaatgatt 

agatagttta 

gctgggcagt 

ttctgagttc 

gagaaaccct 

gaatacaatg 

caaggaaaaa 

tgaacaagac 

ttaactatca 

tctaaagaac 

atctgggctc 

gtttatctat 

gtatgcatga 

nnnnnnnnnn 

nnnnnnnnnn 

tgaactcaga 

acaacccagc 

atgtgtgtgc 

cctggaactg 

gcagcaggtg 

tttgtttcat 

aagccctgac 

gaggtcttgt 

gacctaaatg 

tattacatgt 

cattacagat 

agagcagtcg 

taatgtatgc 

ctctcccttc 


21540 

21600 

21660 

21720 

21780 

21840 

21900 

21960 

22020 

22080 

22140 

22200 

22260 

22320 

22380 

22440 

22500 

22560 

22620 

22680 

22740 

22800 

22860 

22920 

22980 

23040 

23100 

23160 

23220 

23280 

23340 

23400 

23460 

23520 

23580 

23640 

23700 

23760 

23820 

23880 

23940 

24000 

24060 

24120 

24180 

24240 

24300 

24360 

24420 

24480 

24540 

24600 

24660 

24720 

24780 

24840 

24900 

24960 

25020 

25080 

25140 

25200 

25260 

25320 
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tattcatgct 

gtggcaggga 

ctgggtggaa 

aattttattt 

ggacctgcca 

tttacttcca 

aaatagtgac 

cccgggaagc 

tggtcagctg 

gctgagtacc 

caaaggatac 

aggaggcaga 

gatgggatta 

aaccccaccc 

gagaaagtcc 

gccctcgaag 

cctcccaccc 

tagaacttgc 

gggagcaggt 

tctctgacag 

ttccatctca 

tgccccagct 

cagttgtcaa 

atcccaacac 

gatcaccgca 

aattaccgtg 

aatagctgcc 

tgtcagagct 

tccccataag 

aaccagtaag 

caccaaacca 

gtcttttccc 

ccccacccct 

ttttgtagac 

cccaaatgca 

tctagctgca 

tggacacgca 

tgggtgggta 

nnnnnnnnnn 

nnnnnnnnnn 

gtttggttct 

ctggccccac 

cactactctg 

gctaacaagc 

tctcccacag 

taaaataact 

gagggtggtg 

gaactgcccc 

gacaggtgtg 

tgacaggaca 

aaaagccatc 

gtgtcctggg 

agtcagtgag 

tttttttttt 

agacagctcc 

tgctgggatt 

tctctccagc 

gaggagagtc 

ccataactac 

ccctctggcc 

aagagggcag 

aaaggagatg 

ccctttggct 

cggcgctcac 


tgggtaactg 
ccactgcgcc 
gtagagggct 
aaattttata 
acggttgtca 
tactttctac 
agatgacttt 
ccaggctaaa 
agggaccctg 
caaagtgctc 
aggcatgagg 
gctccaagag 
gaaaggcatg 
ctgggtgtcc 
caagaccagg 
actagcttgc 
tcgtctcttc 
cagtgctcac 
atgcctcaga 
ttcctctcct 
aaaaacaaac 
tggctgggaa 
ttaattgtaa 
aaagaaatga 
cattgtatac 
cagtgattca 
cttgaccccc 
tcctgtggga 
aggagtcacc 
atgtgagcag 
aaaacaaaca 
acataaagtt 
cccc gagaca 
caggctggcc 
gggattaaaa 
tacgtatcaa 
aactgtatat 
gggaagtggg 
nnnnnnnnnn 
nnnggcagga 
gagcacctga 
tgcccactcc 
ctcctctccc 
tgggcagtgt 
tgagacactc 
agggggactt 
acagctcagg 
tggacaacct 
accccagtac 
aggttggagc 
atggcctgag 
gacctcttga 
gtcaagggaa 
tttttaagat 
agaagagggc 
tgaactcagg 
cccctctcag 
atgctgaagt 
tgactccccc 
ccacccattc 
gcaggggcag 
aagaaaaggc 
cttcccccta 
cggagagaac 


gccctggctg 

cacagggctc 

gcatatataa 

cagctctata 

agggtaatgg 

attaccccac 

gaaggaaaat 

tgtaatctgg 

gtgaccctct 

ctctgggctc 

actgaggcct 

tccaaccaag 

tttgcaaata 

cttccctgcc 

ctggctggag 

tcgcccacac 

ctccaacctc 

tgtaatgtcg 

tctccctcta 

cctgaatcct 

actgggaagg 

ggggtaa c tt 

atttcaaaat 

taaacattca 

atgtttttga 

agataaaaac 

ccacccccac 

gggtggtttt 

cctgtcccac 

ggatgatacc 

aacaaacaaa 

aagcacacaa 

atgtttctct 

tggaactcac 

gcatgagcca 

aagatgacct 

gcctcagtac 

ggggagggta 

nnnnnnnnnn 

tcctgtgttc 

aggggacicaa 

caccctctgg 

agatccacgg 

cacctccctc 

atgtccactg 

gtcaatcact 

gacctgcatt 

ctgcagcaac 

ctagtgccag 

tggtagatgt 

ccagcctgcg 

gggtagcatg 

cagctctagg 

ttatttattt 

atcagatttc 

accttcggaa 

tcctgatgcg 

ccttcctacc 

acccccattc 

ttccccagcc 

cttctgctgg 

ccttcccatg 

aacctctcac 

tgttgagaaa 


attgatgaga 

cctcaggatc 

cacgatctca 

taaacgaagg 

cgatgtagtg 

aactggcaag 

tgaatcggta 

gtcaggcctc 

tgggccagca 

caagggcctg 

gacgaaccca 

aggcaggaag 

ctttcaaatt 

tcttgccaca 

ctcctgatag 

cgccagatgt 

agtggagggt 

tccctgtagg 

ttcgctgccc 

gcttgttggc 

tgttgagaga 

acaatgctct 

agctagtaga 

aggcgatggg 

aatgtcaggc 

tataatttta 

aaggtccttc- 

ggtgtacaca 

gatgccatgc 

tgtctcttat 

caaacaaaaa 

agtagccacc 

gtataacagc 

agagacacag 

cgaactaacc 

agttggccat 

aggggaaczgc 

tggnnnnnnn 

nnnnnnnnnn 

atgtgcaaca 

gcaggctgat 

catccccatg 

agccataggg 

aaggttcctt 

gaagatattg 

acactcttag 

ctaccaaata 

tgaaccctct 

gtgagagagt 

ttgggccacc 

agtctcctct 

gtcaccaaaa 

ttctctgtgt 

attatatgta 

gttacggatg 

gagcagtcgg 

acagggcagc 

ccaccccttc 

ccttgcttcc 

ccacctgatt 

gcctgctcac 

ggtccccatc 

ttttgagtac 

agggag aac a 


gctgatttcc 

ctcagtacag 

actttatttc 

aactattgaa 

attttttttc 

tattatttta 

aaaagaaagc 

ccaggcctgg 

gggacgggga 

cacagagact 

gctatcattc 

ctttgggacc 

tacgatgcac 

cccaatagct 

gttccaccct 

ctgtgtcttt 

cccctgcttc 

tgtcatggtc 

tttcaggctg 

atgcgaacag 

cagagagcat 

actgcccagt 

gaggatttta 

tatgctaatt 

tgtaccccat 

aaaagctaaa 

ctgtttgtcc 

gacactcctt 

aggg ccac at 

aacaaacgga 

cagggttggt 

atttatttat 

cctagctgtc 

agattcacct 

agtaccccag 

cactggaaag 

cagggccaaa 

nnnnnnnnnn 

nnnnnnnnnn 

ctcgatgcaa 

gcccaggcca 

atgaacatgg 

tccccagatt 

tcctgctctg 

tagccattaa 

ccccggactt 

gccatgtgtc 

gtggtctcct 

gctagggcca 

aaagagaaca 

gcagttggga 

tcctacaagg 

ccctcacgga 

agtacactgt 

gttgtgagcc 

tgctcttaac 

aaaggccttg 

cgaacccctg 

actgatccgg 

gtacctggtt 

tcactggctg 

ttgccaagac 

ctgctggccc 

gagaactcag 


ccactgccct 

agctgcacag 

tttaaataaa 

ggttcagcaa 

ccccttccat 

aaatgaaagt 

tgagagacca 

ggtctcaaga 

ggagccggaa 

gtgtgggaat 

gtcctagaac 

cgagatgggc 

actcactgga 

gacatcactg 

cctgcagagg 

ctctcttttg 

ctggggaaag 

ccccattact 

tctcagtttc 

gctcaatatc 

gggtaatggg 

agggtagctg 

gatgttccca 

gctctgatct 

aaatatgtac 

aacagaagga 

agccacttaa 

cctccctcct 

gcgtgatatt 

aaaaaaacca 

ctgtccctgt 

ttgtcccctc 

ttggaactca 

gcctctgcct 

agctcttgac 

agaggcccat 

aaaatgggaa 

nnnnnnnnnn 

nnnnnnnnnn 

gctgtgtagt 

cgggttcttt 

ccacagatca 

catctctgca 

agcagcagtg 

attcctgtgc 

ctgactcata 

cctgatggag 

agttcttctg 

cactaagggg 

ggtcagtagt 

cactcttgca 

acagatcaga 

cctttttttt 

agctgtcttc 

accatgtggt 

cactgagcca 

tcccagatct 

aacatcagcc 

tcctcctctt 

gtccaacttg 

tagaaatgag 

ataggtgagt 

gggagatcca 

cgttcctccc 


25380 

25440 

25500 

25560 

25620 

25680 

25740 

25800 

25860 

25920 

25980 

26040 

26100 

26160 

26220 

26280 

26340 

26400 

26460 

26520 

26580 

26640 

26700 

26760 

26820 

26880 

26940 

27000 

27060 

27120 

27180 

27240 

27300 

27360 

27420 

27480 

27540 

27600 

27660 

27720 

27780 

27840 

27900 

27960 

28020 

28080 

28140 

28200 

28260 

28320 

28380 

28440 

28500 

28560 

28620 

28680 

28740 

28800 

28860 

28920 

28980 

29040 

29100 

29160 
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tctccaccct 

actgaccact 

caggacaagc 

ggagtgttag 

cttgaccgca 

ctctgagagg 

tttcgtctta 

gaccttgcgc 

agttcttcct 

gcatgatgcc 

ctttctcagg 

cagccagaaa 

gaactacaag 

ccagctggcg 

cctacccagg 

cctccacaac 

tttttctcgc 

cctggacctc 

ttcatccaga 

gctgcaccac 

gtgttctcct 

ctctcctccc 

gtaaatattg 

atcaaggcta 

gaggaggttg 

cagtgaggct 

cagccaaagt 

caccaccacc 

caccaccacc 

nnnnnnnnnn 

nnnnnnnnnn 

gaacatctct 

gtgtgatcct 

tgcctacaca 

ccctttagct 

ggctcttagc 

ctcccatttc 

cttggaccac 

ccccctggag 

ctgtcctagt 

ccatcaccat 

ttctgacttc 

ggtccttaaa 

tcgttccttg 

taatccctca 

cagggg c a c t 

gcagatttag 

tggctgagcc 

tctgggtagg 

agtaaatgag 

ggtggtggga 

tgaggtggga 

caagcaggga 

ggaaatggag 

atgcaaggcc 

ccaggacagt 

gggggaggag 

tcccatgtgc 

tgagaagaca 

tgttgctcct 

gacagtgcag 

ggctggctgg 

gaaatcttgg 

gaaaggtcag 


tctggcctct 

tcccactcag 

tgagtacgag 

ttcctacctc 

ccaggcfcgcg 

tgggatttcc 

cactgtggat 

atctcccttc 

gctcccatgt 

tggcccccac 

ctgccctggt 

cttctttgtt 

cctcatccca 

gaagggggga 

ccttatttca 

tgtgttcaag 

caagagtagg 

tgctcctcac 

ccccttgcca 

ccctccatcc 

ggaccagctt 

acaggacctt 

acctgtacat 

cccagaaccc 

ggcctggcaa 

cagagaggct 

cccaccacca 

accaccacca 

accaccacca 

nnnnnnnnnn 

nnnnnnnnnn 

ccttttttcc 

tctgcctctg 

atgtctfcagc 

ggcttctctt 

cccagagcct 

cctctatctc 

aagacaaaac 

tcaggtctag 

gtgccctcac 

cctctcatgt 

ccagaaaaat 

attaagtcag 

ttcactctgc 

tctatagcag 

gccactcagc 

gcttcttgtt 

tcaggaggct 

tagcaactga 

atacccttgc 

gtctaagtca 

aatctgatgt 

caggtgctag 

actctaaagg 

ctggctgcaa 

gatgccatcc 

ggtccgggga 

cccaaatcag 

agtgggagcc 

cctcctccca 

gggaggacac 

ctcatggcta 

aatctgaatc 

ccctgcagaa 


cccagatttg 

acctcagctc 

gagcccccgg 

acgctcgaag 

gaacctggac 

ataagctttt 

cagtcctggg 

atgcctttgg 

agatgttgag 

cctagcaaag 

cctacccaaa 

gaccaatgaa 

cttctgctcc 

ggctctcagc 

ggcaccagct 

aaccttcagc 

cctccaggtt 

caccccagga 

aggtgccagc 

tacctcttca 

gctccccacg 

tgctctcagc 

ccggttaaac 

tgcctgagcc 

gccttctaat 

gtgtggcctg 

ccaccaccac 

ccaccaccac 

ccaccaccac 

nnnnnnnnnn 

nnnnnnnnnn 

tttttgagac 

cctcccaagt 

agcccttagc 

gtgaagggct 

tccttccttc 

cctaagcaac 

tgcagcctgg 

gggaggaaga 

ccatcctata 

acacgtgccc 

atctgatctg 

acaatccatg 

ttttggcagc 

gtctcccggt 

tgctcaggga 

tcccttccca 

gatttatcag 

gacaggagga 

agatgggacc 

cagatctttg 

gcatggggaa 

ggggatgtct 

gcagaaagtg 

taactgggtt 

tgtccaaagg 

gaggggtcaa 

gaggcacaga 

agtacatgtg 

gagccaagct 

ctctctctcc 

cgccgctcac 

agcctgagat 

cagaactctt 


cccccgcccc 
tgcctcaccg 
agcagtgcca 
gccaagcagt 
tcaaaacata 
tttttcacct 
ttcaaattta 
ctttaccctg 
gacccaaata 
ccacctgacc 
ggctctgaga 
tgactgg ccc 
aagttctgat 
ctagagagcc 
cttctaaaag 
cagggcctca 
ttggagttct 
cgctgtgaag 
tgtctacttc 
ggcttcttag 
ctcccccagt 
caacccccgt 
attgatatgg 
tggagaaggg 
gatccctcaa 
tgtaagggcg 
caccaccacc 
caccaccact 
cacctcatct 
nnnnnnnnnn 
nnnnnnnnnn 
aatcttacta 
gctgaggtta 
agcaccaggg 
atgtcttctt 
aggttaaaca 
gaccttttct 
gctgtgtgtc 
cagggttcac 
gcacgcacat 
tccctcgcca 
agaagttagg 
ggacatgaag 
accaccagca 
gggaatttta 
gagacccctt 
gcccct ccca 
agaggtgctc 
gatggtcacc 
cctgaagttc 
ttaccacgtg 
acacagaggt 
gaatgttcca 
agggtgtgca 
ggggaggcag 
gcagtgtcca 
gattctcccc 
aactgggatg 
tttcagatta 
gccttcaagc 
actggctcaa 
cccctggaca 
accccataat 
tggtccccac 


ccagcatctc 

tgaaacaggg 

tgttcctgta 

agactgctat 

gcagctgtgg 

gtacatttag 

aagccctcat 

tcttggtaat 

agaatctctg 

tgttgttcat 

gctaatctgg 

agacaccttt 

ccagggtgct 

ttcctttcca 

gtccttctgt 

tctccaatct 

agaggtttct 

ctgcaggctc 

ctctgctgcc 

cgcagcacac 

gcagccagca 

tcagcttgtg 

gggccagaag 

gtttacagga 

cataggggat 

cagagtgggc 

accaccacca 

accaccacca 

acccatacta 

nnnnnnnnnn 

tctatgaggg 

tatttaggct 

caggcatgca 

gtcaggaagc 

ccccttccta 

gcaccagttt 

gctctgactc 

ctcgcacatc 

gactcagaaa 

gctgatgtgc 

gacacatgca 

agtctgccat 

ggcaacaacg 

ggaaccaacc 

gggacctctg 

agaacaacaa 

tcacaggcaa 

agagaggcac 

ctgggcatcc 

ctcccggggg 

gttagactga 

ccaatgctgg 

ccccaagcca 

gcaggggctg 

tcattggcta 

agccagattt 

ctctgagtca 

ttgtggtctc 

aacccagtcg 

cacatggcag 

ggacagtttc 

gtttggggtt 

tgtacctccc 

ccatccccct 


cttcagcctg 

accttgcagg 

tccagaacag 

ccatgggttc 

acctcactca 

tcttcattct 

cttgcaagag 

tcatggcaga 

taaatactga 

ttcatccagc 

gctggcaggg 

ggacttacgg 

tcggggaagc 

tcctcagccc 

tatccctaga 

ggatatatga 

cctggagctg 

cctgaataaa 

caagcagcag 

gcagcacacg 

gggcctagct 

ttcagtgctg 

accctttccc 

gcagataagt 

tatccacagt 

tccagagtca 

ctaccaccac 

ccaccaccac 

anttgaggct 

nnnnnnnnnn 

ttacatttta 

ccccttgaac 

cagtcacatc 

cctcaactgt 

gcatggagac 

ttggtgggac 

tcatctggca 

attcctgtgc 

agaccactgg 

cccctccgct 

tcactaactt 

catcagctat 

agaagactcc 

tggctctccc 

tgttctcatc 

agaaatcaat 

cagcctccct 

ctctggtccc 

tctaccagga 

cgggggtggt 

ggactgaatc 

ccaagagcta 

caggaataac 

cacaggacac 

gccaggggca 

ctaggctcca 

aggttggcct 

acatccaagc 

gagacaaaca 

tgaatatgcg 

aaggggttca 

tttccctcct 

aacaccccca 

cagccctgga 


29220 

29280 

29340 

29400 

29460 

29520 

29580 

29640 

29700 

29760 

29820 

29880 

29940 

30000 

30060 

30120 

30180 

30240 

30300 

30360 

30420 

30480 

30540 

30600 

30660 

30720 

30780 

30840 

30900 

30960 

31020 

31080 

31140 

31200 

31260 

31320 

31380 

31440 

31500 

31560 

31620 

31680 

31740 

31800 

31860 

31920 

31980 

32040 

32100 

32160 

32220 

32280 

32340 

32400 

32460 

32520 

32580 

32640 

32700 

32760 

32820 

32880 

32940 

33000 
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ggctgaactg 

ttcctttgac 

agggcagc ca 

atgacctcat 

aacctagcag 

tctgaaaagc 

tcactgctgg 

ggtactgctg 

cctttcccaa 

ctaatagagt 

ggttcctaaa 

tgctctctgg 

tgtcttcccc 

gcctgcactc 

ccagcaggct 

tgtgattgca 

tccaagacca 

tcttcattgg 

ggggtaggtc 

tggagagact 

gctcactgca 

tgggactgta 

tggcccagtg 

cacatagcaa 

ctnnnnnnnn 

nnnnnnnnnn 

catgccttta 

cattgtctac 

ataaataggg 

ccccagcaac 

tcaaggtgga 

agtatgctcc 

ggtaacatgg 

ttcagtggcc 

tttatgctgt 

aataatttaa 

acatgtgtgc 

gggttgttag 

ttaatcactg 

ggctcagagg 

ccacatggtg 

agacagctgc 

ttgttttttt 

tgaaactcgc 

ccaagtgctg 

caccaccacc 

gaaaatgaca 

tgcataaaca 

ggtttggatt 

agtagaactg 

ccaaggggaa 

ggctgcttca 

gctgttgatt 

tgttagtaat 

tcctcttcct 

gcagagcccg 

aacagcttca 

ggctacactg 

tctgtagagc 

gggcccccgt 

gcgggaaggt 

cattccactc 

tcttgctgaa 

attaacacca 


atgggcagct 
tcagaccact 
gaccccacaa 
ttgtctccca 
gcctcaggca 
agcgatccga 
tggaagcaac 
agtccagccc 
caccattcaa 
cctgggcctc 
ctgcaggggc 
ggtgcccctg 
aggctagacc 
tgaggggctc 
tcagattaca 
tcattgtctg 
cacgttggcc 
gaccttcatt 
atgagtctga 
gtgagccagc 
ggtgggagca 
cctgacagct 
cttatgagca 
ggactcgaaa 
nnnnnnnnnn 
nnnnnnnnnn 
atcctagaac 
atagcaagtt 
ccagttggca 
catgtaaaaa 
gacagacgga 
cagccagtga 
tggctcagtg 
actaaaggtg 
ggtacacaca 
aatgttatgt 
aatgcctgtg 
ctaccatgtg 
agccatctct 
ttaagagtat 
gctcacaacc 
agtgtactca 
tgtttttcga 
tctgtagacc 
ggattcaggg 
ccaccaccac 
cctagcattg 
cacataaata 
gtgt cccccc 
ttagggttta 
tctctggctt 
ccacatgccc 
tccccagata 
tgtgtgtgga 
catccacctg 
ttcatgttac 
aacaggagaa 
aagctaccgg 
actgagacat 
gtccgtgtcc 
gaggcggacg 
gaaagcagcc 
gtcgggtgat 
cacacacaca 


aaggtccaga 

ccctgccgtg 

accagctagg 

gctcttttgg 

gcccactgct 

atggagtcag 

ttcaggaaga 

cagagcttag 

aaccccagtc 

tgtcccccca 

tactataacc 

ccactcccag 

agg c gctg cc 

caggaagcag 

gctactcttt 

gtgttaactt 

accctcctct 

gggacatgat 

acttgaacat 

cggtatggag 

aggtgattct 

ctgaaggttc 

ctggctgctc 

ctgcttggaa 

nnnnnnnnnn 

nnnnnnnnnn 

tcaggaggca 

ctaggctagc 

aaatttagct 

gaagcaaggt 

tccatggggc 

gagactgaaa 

agaaaagata 

gaaggagaga 

cacgcacaca 

gtatgggtgt 

aaggctagaa 

gattctggga 

ccatcccctc 

tgactgctct 

atctgtaatg 

tatacataaa 

gacagggaga 

aggctggcct 

tttgcgccac 

actacctgac 

tcctctggtc 

cataagtaaa 

aagggaaaaa 

ggcctggtgg 

catcttttcc 

agaagcaagg 

tttgtagtag 

ggtgccctcc 

cactccgaga 

aagtatgtaa 

ggaagaagca 

ccgccttcct 

taagtactac 

ctcagaacaa 

ggctggaggc 

ctgtgttcca 

tcaaacaacg 

cacacacaca 


cagtggctgg 

gcttacatca 

ctaaatggtg 

taagagagaa 

ccttgctgta 

gctgagctgc 

gcccagccta 

ctccctgctt 

cagcctctcc 

attctctctc 

ctacctccac 

tccctctagc 

ttacctgtgg 

tgaggggagt 

tcttaaagtg 

taacccactg 

cccaccacat 

ggctaacttc 

ctgaaaggat 

atgctgggtt 

tctcccctcc 

caaaggacag 

tcgcagagga 

ctccagctcc 

nnnnnnnnnn 

nnnnnnnnnn 

aagcagatgg- 

caggtacatc 

tgccctccta 

gtggtggcac 

tcactggcca 

aataaataaa 

cttgtcatac 

accaactcca 

cacacttgtt 

tttgcctaca 

gaggacatca 

acaaaaccat 

aatagtatat 

tccagaggtt 

gaatctgatg 

gtaaataaat 

cagggtttct 

tgaactcaga 

cgccaccacc 

tatttaactt 

cctacatgac 

cttagtctgg 

ttgggtccca 

gaagtggcca 

ctttgctttc 

ggagcagtca 

taagagacag 

ggggacgggg 

acaggaaatg 

attcataagg 

aacattaagg 

ggaatgtata 

tggggtccag 

agaggaaagg 

ctgggctggg 

cttgggtgag 

actgtagctc 

cacacacaca 


ctcttggaaa 

ggaggtgcaa 

cttattgttc 

tgagaggaag 

a< ?g < 3' aacca 9 

aggaagctca 

tgggactata 

cccaccaccc 

tactggtcta 

ccctctcafcc 

ttccttgcac 

cagggagcct 

ttgcggcagc 

agctgcctct 

acctgactcc 

ctgcccttcc 

ctcccttgga 

aggggcactt 

tggctgagag 

cttccaggcg 

tcacctggaa 

tggggtgggg 

cctgagttct 

agagaatctg 

nnnnnnnnnn 

nnatgtcagg 

acatctgaat 

ataagaacct 

acacaagaac 

ttgcttgtga 

gccagctagc 

taaataaggg 

aagcctagca 

aagaattgtc 

acatgcatat 

tgcatatctt 

gatcccctcg 

gggttttcca 

atttctgggg 

ctgagttcaa 

ccctcttctg 

aaaccttttt 

ctgtatagcc 

aatccgcctg 

tccaaggctg 

ttaaaggcag 

cccatgtgca 

ttgttttgga 

gagtagtact 

tggctagaga 

aggtcctaag 

tggctgggac 

gtgaggaacc 

gccctcctgg 

gtgactttgg 

accagtttct 

aaaagctctt 

atcagcttcc 

gattctgcct 

ttggttaagg 

gctgcttcct 

cttcacgggt 

tgtggagatt 

cactccctgt 


gcctgtctct 

gggctgcagg 

gcaagaggcc 

ctggacagag 

caccgatggt 

ccttccttgc 

gctcctccgg 

accaccacat 

cagtgagcgg 

tgttcacctt 

ccctcttttc 

cttccatatc 

ttctctcaca 

caaccagcgt 

atttggaatc 

gccatgtggc 

tctttatctc 

gggccagcct 

gcaggctgca 

cttggctctg 

aatgaaggaa 

actagagagt 

gttcccagct 

acgctatctg 

nnnnnnnnnn 

catggtggtg 

ttgaggcccg 

ttctcaaaaa 

ccaaagtcaa 

tccagcattg 

tggtctactt 

gttaggaaga 

accctcaatt 

tcctgacagt 

gcatacaatt 

tctgtgcacc 

gagttacaca 

aaagaggctc 

ctggagagat 

ttcccagcaa 

ctgtgtctga 

tttttttgtt 

ctggctgtcc 

cctctgcctc 

ctgctgcggc 

ccatctcatg 

aacacatacc 

aatgtgctat 

attggaagag 

gacatgccaa 

acagtcacaa 

cgctaatgca 

ccacagcaag 

ggcaggacgt 

cagagcttaa 

ctccatatga 

ttattgcaga 

ctctgggggt 

atgaagagga 

tgatagtcta 

gccccctctt 

ttgccagtaa 

cagagattcc 

ttgtgtaggc 


33060 

33120 

33180 

33240 

33300 

33360 

33420 

33480 

33540 

33600 

33660 

33720 

33780 

33840 

33900 

33960 

34020 

34080 

34140 

34200 

34260 

34320 

34380 

34440 

34500 

34560 

34620 

34680 

34740 

34800 

34860 

34920 

34980 

35040 

35100 

35160 

35220 

35280 

35340 

35400 

35460 

35520 

35580 

35640 

35700 

35760 

35820 

35880 

35940 

36000 

36060 

36120 

36180 

36240 

36300 

36360 

36420 

36480 

36540 

36600 

36660 

36720 

36780 

36840 
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tgattttcaa 

cactctgtga 

acctggcctg 

acacacacac 

taatggtgtc 

cagcaaggtg 

catgtcctgc 

cagcacagct 

gctctcttct 

catacttcct 

tgtgagccag 

gctgtaacgg 

tactggcagt 

aatgaacaca 

aagcaacttg 

actaagggga 

ttgctccccg 

tgacactgcc 

acgggtttgc 

aaatgactct 

aagtggaatg 

cattgaacca 

ctgtggacac 

nnnnnnnnnn 

nnnnnnnnnn 

gagagagaga 

attccaacac 

taagtttcag 

cacacacaca 

tggggaaaca 

gccctcactg 

gaactcagaa 

tccccccttt 

ctgggttatt 

ttcatcagtt 

aactggtgtg 

gagcagtctt 

acaataaaag 

cctcgcctga 

ctccgcatgc 

taaatacttg 

caaagttcgt 

agattttggg 

gtaatggtgg 

tgagttcgag 

aaacccctgt 

aaatgttttg 

aaagtttagg 

gtgagacttt 

agagtccagc 

aaaggcattt 

gagtgtgctg 

ctcacataca 

gggtcaaaag 

gtgccacact 

tgagaccccg 

tctgcaagat 

caaacctagg 

agatgaggag 

gcgcttctgc 

agctcgggac 

agcccatagg 

acgaaccgag 

agccttgtct 


gaaagcaagc 

caggctcagt 

agagaatgtt 

acaccaagga 

acatgttccc 

gcaggctcca 

agagacttgg 

tggatagtct 

atcatggtca 

gtcaagggca 

aaaaaacaaa 

ggccccagcc 

ccttccatga 

cgtttgtttg 

gagaggaaag 

ctcagggcag 

tggcttctta 

cattgtgggc 

ccagaggcca 

aacttgtgtc 

tcccccagag 

gaaatgaaca 

agcgactggg 

nnnnnnnnnn 

nnnnnnnnnn 

gagagagaga 

ttgagaggca 

aatagccagg 

cacacacaca 

tctgagcaga 

gaggtgtctg 

ttcagtttca 

atctaaaatg 

tgcatctaca 

tcctataccg 

gtgctacttg 

gactactcag 

aaaagaaaag 

aagcctccac 

accatggctc 

taatgattag 

gcacattgac 

atgcttcatg 

catatacctt 

atcagcctgg 

ctctaaaaat 

taggctgcct 

ggactctaca 

gcagtgggca 

tttccataaa 

gcagtctaag 

actccacaaa 

tgtccacatg 

cagcaagaga 

ctgagcacag 

cctcactatg 

gagcccatca 

caggcaggat 

ttcacatcaa 

cagcacctga 

cccagtgagg 

aagaacaatg 

gactgcacat 

gacagcatgg 


tagaagtgga 
gacctacctg 
ttgggctgtt 
ctgagtctaa 
tgagccaccc 
agactcagcc 
taagttcg cc 
tcagttctcg 
agttctgggt 
gctgggcaat 
acaaacaaac 
ttgtgct ccc 
gtagggagtt 
tggttccgct 
ggtttatttc 
gaactgaagc 
gcctgctttt 
tgggcccccc 
gtctggtggg 
aagttgacca 
actgcctgcc 
ataaaatgga 
cttcccagag 
nnnnnnnnnn 
nnnntggcct 
ggataaaagg 
gaggcagggg 
gctacacaga 
cacacacaaa 
ggaagaaatg 
cacaggagaa 
aagcactgac 
agtaagaacc 
aaagaggtct 
ctgactacac 
ctcatgatcc 
tgaatttgag 
gtggcgagtg 
acggaagggg 
gtgcaccccc 
tttctgaaga 
ctgtcgaccg 
ttgcagctac 
tgatcccagc 
tctacagagt 
cactaattta 
taacaaagcc 
agcatcctca 
aggttcagct 
acctgtgcag 
cccggtgaca 
tttgtcctct 
cacacatgca 
tgccctgtga 
gtggtactga 
tagcccaggc 
tgcccaccat 
ggtggactcc 
tctcccgtat 
ccaatgcaga 
gtgctggggg 
tcagctggcc 
gaagggatcc 
gaggggaggc 


gtacctcaca 

agaacttata 

ataggacaca 

tgggaggtgg 

tacaaagaaa 

ccgagcgcaa 

ttcgaagctc 

gtcgatgtca 

cacggtcagg 

ctaacccaac 

aaaaaaacac 

cagcctcagc 

ttcttctcag 

acggtttcta 

atctgacaat 

ggaagccgtg 

ttatgctatzc 

cacatcaatc 

ggcattttat 

cacgaatcag 

agcactgctg 

tcctttgaac 

cccccatgga 

nnnnnnnnnn 

catcaggtat 

ttagcccagt 

gagctctgtg 

gaaaccctgt 

taagatcttt 

gggtgcgcag 

cacttgcact 

aggagcagtg 

aaaaaaacaa 

tggggaggaa 

aggggctgaa 

caacacttgg 

gctagcctgg 

aggtggccca 

aaagccagct 

acacccaccc 

acaatatttt 

tgtagtacgg 

tctgatgaac 

acttgggagg 

gagttccagg 

aaaaaaaatt 

caatggccat 

ggaaggccac 

ctttatgtag 

ggcctctagg 

tgagctcaat 

gatctataca 

tatacatgcg 

aaaacgtctc 

tatcgttcct 

tggcctggaa 

gttattgaag 

tgtaatctca 

gcgttggagg 

tgcagatgct 

gaggactgga 

aaaccaccca 

atggctccag 

cattggtcct 


gtgacttgtg 

gccaagatgg 

tagagataca 

ttcttcattc 

gccacaggac 

agtggccttg 

agcttcagct 

tgaatgaact 

ggcaggaaga 

agagatgcgc 

caacagctgc 

ctgggctgta 

cctaaaaccc 

ttgtgataaa 

tcgcagggtg 

gaggaacgct 

cagaaccact 

actaatctag 

caattgagtt 

ggcctggttc 

accatttgct 

agatgtgttg 

atcannnnnn 

nnnnnnnnnn 

cagagagaga 

ggtggtggca 

ggccagtttg 

cttgaagaga 

aagaagaaaa 

gacacccacc 

cagcttgccc 

actggggacc 

aagtgtttgg 

acccagttct 

ggtaatctca 

aaggtaggtc 

gctacatgaa 

tcaggtaaaa 

tctacatgtg 

acccacccac 

cgttgatctt 

gatccgctcc 

agtgttgctt 

cagaagcatg 

atatccaagg 

cctttctaaa 

tcagagaagg 

agaaagcaga 

gaagaagaga 

caaagccctg 

ccttggaacc 

tgtatgcacg 

catgcgtgca 

attcagtctc 

gattgatcga 

ctcacagtga 

caataccatg 

ggacttgaaa 

ctggagtggc 

cacagccaac 

ggagctgaga 

gagctcccag 

atgcatatgc 

gtggaggttt 


agctatgagg 

ctgaagccag 

cacacacaca 

ccctcccctg 

tcagttctgt 

caaacatact 

tggggacagt 

cagaggtgag 

agatgatgct 

acaggttagt 

cttcccctct 

ggctactggt 

acagaagttt 

acatgactga 

tcttctcatc 

gctttctggc 

tgcccaggag 

aaaatgaccc 

tcacccttcc 

ttaggagctg 

ttgtatagag 

atcctagggc 

nnnnnnnnnn 

nnnnnnnnnn 

gagagagaga 

cataccttta 

gtttacagag 

aacacacaca 

gaaaggatag 

ctcagaggag 

tagggcgtca 

ccaggttgaa 

gatttggaat 

acccctggaa 

atgttttcat 

agaagttcaa 

aactcataaa 

gtgccaacca 

gtcctctgcc 

ccacatgaca 

gtttagggaa 

aggaagctaa 

tctgggccag 

tagatctctg 

ctatacagaa 

cctatataac 

ctcaaaagag 

gcctgggcca 

gtcaacagtc 

tgttaggggc 

caggtggaag 

tgcacgcaca 

cccacacaca 

ccatcatcca 

tcagttgatt 

tcctcttgct 

ctctataaag 

agtagaaggg 

tgttccctgg 

catcagactg 

cgggattgca 

ggactagacc 

agcagaggac 

gatgccccag 


36900 

36960 

37020 

37080 

37140 

37200 

37260 

37320 

37380 

37440 

37500 

37560 

37620 

37680 

37740 

37800 

37860 

37920 

37980 

38040 

38100 

38160 

38220 

38280 

38340 

38400 

38460 

38520 

38580 

38640 

38700 

38760 

38820 

38880 

38940 

39000 

39060 

39120 

39180 

39240 

39300 

39360 

39420 

39480 

39540 

39600 

39660 

39720 

39780 

39840 

39900 

39960 

40020 

40080 

40140 

40200 

40260 

40320 

40380 

40440 

40500 

40560 

40620 

40680 
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tgttggagga 

agaggcaaaa 

agaaaaaaga 

acccccaagc 

atacgtgcaa 

gtgcacatgt 

ccaacctaaa 

tgtaacatta 

aaggcagcca 

gctgcagggg 

ctgtaactcc 

aggtgtgcac 

catgtggatc 

ggtggtgcac 

aagaccagcc 

gtctcaaaaa 

gaggcagagg 

ggacagccag 

acagaaaact 

cttccagggg 

acagacatgc 

aactaataat 

tagcacttgg 

agtgagttcc 

ataaaaaata 

aggcgttcca 

tccctcagct 

tctgtgagtt 

aatagaaaca 

gactcagtgg 

aaaatttatt 

gggagcctag 

aattaagagc 

actgtagctg 

agccaccatg 

taagtactga 

ctgagaccaa 

ttctccagtt 

gagtagacca 

cagatgggaa 

tggtgctgtg 

tgattattgg 

tggggtttgc 

tcttcgccat 

cagtcctggt 

ctattaatta 

ggaactctac 

catgtgtgac 

ccaatgttct 

ctgatagggt 

cattctagct 

ctcctggggc 

cacagatgca 

agatcaccct 

gacaacactg 

ttgctgcgga 

gagagagtca 

ggtctcagga 

ggattcaatc 

tttttgtgac 

tgtttgcgct 

ctcatctcct 

cctcagtagc 

cattttctcc 


tgctggagcg 

gggatggggg 

tgtctctgaa 

cccacagggt 

tccatggcat 

aattccatca 

gacctacacc 

aagttagaaa 

catcaaaaca 

cctgaattct 

agctccctgg 

acatacacat 

tgggaggagc 

gcctttaatc 

tggtctacag 

aacaaaacag 

caggtggatt 

ggctacacag 

ccagcccagt 

atccaatgcc 

acataggcaa 

ttaaagaaaa 

gagacagagg 

aggacagcca 

aggccaagta 

tagggctagg 

gcatgtggta 

caaggccagc 

tcntacctag 

ctaagagcat 

taaatgttta 

gtttgagtcc 

ataagaactt 

tcttcagaca 

tagttgctgg 

gccatctcta 

gaactaaagc 

ctgatcagct 

gaggccccat 

tttcagacca 

gctgccacag 

ctatgccagg 

cctgtctgag 

gtgaggctcc 

gctggagtct 

attaattaat 

ccatcataca 

cgcatgcatc 

acttctaaat 

ccttgcctgg 

gtctggaggg 

tgcacctgaa 

tgatgggaaa 

ctactggaaa 

gcagtccaac 

accagacaac 

gagacagaga 

ctgagggtat 

tgaacacagg 

acacccctaa 

tatgtctaca 

ctgaaaccac 

aggcagcaga 

cacccttaag 


gtggggcagg 

agaaggcaga 

agtaaaaagt 

ggaaggagag 

acgcagaaac 

ttcaggagac 

caaagaagga 

tctctcttca 

gtgtaaatca 

gttctcagaa 

gatccaacac 

caataaaaat 

tgagaggcac 

ccagcacttg 

agttagttcc 

ccgggcggtg 

tctgagttcg 

agaaaccctg 

ttccagcaca 

ttcttctgac 

aacactcaca 

aaataaggaa 

caggcggatt 

gggctacaca 

attcttggaa 

aatgaggctt 

cctttaatct 

ctggtgtaca 

agcccngcca 

tgactgctct 

ttacttgtat 

ccaaatcacc 

ctttttaaag 

caccagaaga 

gaattgacct 

cagctcttat 

tgggagattg 

cccgtaccag 

ctaaacagcc 

atgtcatttc 

ttggtgtggc 

aaccagtctc 

gccatgggac 

ctggggtcac 

actgagattt 

cacgtgacaa 

tggcgactat 

agccctgact 

gcatgcacca 

catgcatgaa 

ccctcaatcc 

aaaggggact 

gccttctgga 

ggtggtttca 

agacagacga 

aaggtacgag 

cacacagaga 

agtctactgt 

aaaaaagggg 

agtcactgcc 

cagtctcggc 

tgcagcaagt 

gcagagctct 

taccctatct 


agtgggtggg 

tgggatgggg 

acttgtcact 

aaatgagtcc 

gcaaagacag 

agaagcagca 

caaactataa 

cactgtgtag 

acgagaaaac 

cctgcatcaa 

ccatttctgg 

caaaaccagg 

tgggtgaata 

ggaggcagag 

aggacagcca 

gtggcacacg 

aggccagcct 

tcttgaaata 

cacagggtga 

ctctgtgggc 

aaataaaata 

gccgggggtg 

tctgagttcg 

gagaaaccct 

gaatcccaag 

agcgagagca 

aggctctccc* 

tagctagttc 

aanaaagggg 

tccagaagtc 

tattatttaa 

aagaaaaaat 

aattcttatt 

gggcattgga 

caggacctct 

caggttgata 

aaaaatgcag 

gggtctaacc 

tgcctgcagc 

ccaggacatc 

aggatcaggg 

tcaagcagca 

tcttctgttt 

ccagccgtcc 

accattaaac 

ccccagcgtc 

gaagaacaat 

gctaaaagtg 

aacgcctccc 

gctgtggaca 

actgtgttcg 

gcttaagagg 

agcaactggc 

gtctaggttc 

tctaaacttc 

ctactcccat 

gagacagaga 

agagcatttc 

gggggggact 

tacttccctc 

tcccacttcc 

gacttgtgtt 

gtcttctcgg 

tctctgtctc 


taggtgggga 

gggttgtgga 

aagcatgagg 

cacaagttat 

acaatgagtt 

gagttgttgg 

ggaaaaaggt 

atactgtaca 

cagaaacaac 

gccaagagaa 

cctccatcag 

gatgaagggg 

caataaaaaa 

gcaggcgaat 

ggtctacaca 

cctttaatcc 

ggtctacaaa 

aataagcatt 

ctcacaacat 

accaggattg 

aatctagcaa 

gtgtcgcacg 

aggccagcct 

gtcttgaaat 

gggacactaa 

acttcgctgg 

gaagcagagg 

caggacagaa 

agacctgaga 

ntgagttcaa 

atttaaataa 

gttatcattg 

tattttatgt 

tcccattaca 

agaagagcag 

aaatttaatc 

accaccaagg 

aggcctgtgt 

agctcctctc 

aacacagcag 

gctggcattg 

gctcttctcc 

gatggtcgcc 

ctgctgcctt 

agcaacgttt 

catatgggtg 

gtgacagaaa 

gacaagcccg 

acaggaccag 

cgaggcatta 

cgctgttcca 

gctcctacca 

tgccaaaggc 

tgtgggattc 

caaggcacag 

acaacataca 

gagagtctaa 

cccagcatat 

tcgattatct 

accgccatga 

tcctcccctc 

gactgccaca 

agcttcttct 

tgcttgttga 


gcaccttcat 

ggggtaagaa 

atatgagtca 

tttctgatct 

gggtgtggtg 

aaatctaagg 

ggtcgaccaa 

aggaagagaa 

tcaagagaag 

tcaaaactgt 

catcactcac 

tagggaggtg 

tttggtgcat 

ttctgagttc 

gagaaaccct 

cagcacttgg 

gtgagttcca 

tgttgctgtt 

cataactcca 

catacagtgc 

aaaaaatttt 

cctttaatcc 

ggtctacaaa 

aaataaataa 

gtgtatataa 

tgtatgaaag 

cagaaggatt 

agggcgatat 

ccagagagat 

ttcccagcta 

ataagtaaat 

ctaataatca 

atgtaagaac 

gatggttgtg 

tctgtgctct 

tcgtggagcg 

ccctgctcat 

ctgcttccct 

tctaggtgga 

ccaaatttat 

gcacagtgct 

tatgccatgc 

ttcctcatcc 

gactccatgc 

ctctaaaata 

tggaaaatga 

atgctaacat 

aagcgaaagc 

aggtgcagct 

ttcgcaagaa 

gcaccagtgc 

agcctactgc 

tctggacaag 

caggaaatta 

ctggtagaac 

aaaaagcaga 

agagagtcag 

acaagaccct 

caaattctcc 

agtaaagagc 

tgcttctgtg 

cggaaactct 

ctcttgtcgc 

tccttggacc 


40740 

40800 

40860 

40920 

40980 

41040 

41100 

41160 

41220 

41280 

41340 

41400 

41460 

41520 

41580 

41640 

41700 

41760 

41820 

41880 

41940 

42000 

42060 

42120 

42180 

42240 

42300 

42360 

42420 

42480 

42540 

42600 

42660 

42720 

42780 

42840 

42900 

42960 

43020 

43080 

43140 

43200 

43260 

43320 

43380 

43440 

43500 

43560 

43620 

43680 

43740 

43800 

43860 

43920 

43980 

44040 

44100 

44160 

44220 

44280 

44340 

44400 

44460 

44520 
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cttttccttt 

cgcccccact 

ctctagctcc 

aaaatattca 

tgctatgact 

nnnnnnnnnn 

nnnnnnnnnn 

agagcagcca 

accaccacca 

ggttggttag 

agaaactcct 

aaatgtgaca 

acagtctgta 

cctttgcaag 

tatcagattt 

ggaaagccta 

aacagcacac 

gagtacgata 

tactgtgtag 

acagagtcat 

ttcaaggcta 

tggacatctg 

aagcatcacc 

aggtggggtg 

gtctactcca 

tctagttttc 

caggaatgca 

ctttcctttt 

tatgtgtacg 

tctgggttac 

aatcaccccc 

acagacaagg 

acctgagttc 

ttgtcctctg 

gaataaataa 

tgggcaacaa 

gaaaaaacaa 

gggataaaaa 

tgtctacaac 

cctggaggta 

ggagtactag 

aaatttagaa 

attgaggttc 

agcatattgg 

catattcaga 

aaaaattctt 

cagctacagt 

tcaggctggt 

gttcaaatcc 

cttctggagt 

aaaaaaaaaa 

gtaacccaac 

aaagagaaca 

cgagacacca 

agctgtgtcg 

tgaaatccaa 

acgccagagt 

tgcacaggag 

caattcccag 

aatgcccttt 

tgcatacata 

cgggcgtggt 

tgagttcgag 

aaaccctgtc 


ctatgaacaa 

gtggagaaaa 

atgacttaaa 

gtcaactcct 

cccagatcca 

nnnnnnnnnn 

nnnnnnnnnn 

gtgttcttaa 

ccaccaccac 

tttagtctgt 

gaaagtgaag 

tgtaaataca 

gcctgaatta 

gagaagggca 

tattgctttc 

atcttcatag 

gtgtatgagc 

gcacgtgtat 

gtctcagggg 

cctgccagcc 

actcactgac 

tgctggtagg 

aaggagttcc 

ggtgagcctt 

tgaccaaaat 

tttctttctt 

agagctctgc 

agattttatt 

tatcagtatg 

aggagtttcg 

acccccaccc 

gcctgcaaga 

actgtctgga 

ctctccccat 

aactaaaatt 

taacaccgag 

acacatttcc 

caccctgaca 

atgacctgtc 

gaagctgata 

cctaggggct 

aatgcactac 

cttcttccaa 

gattatagat 

aattaactca 

acaaccatcc 

gtacacacat 

gagatggctc 

cagcaaccac 

gtctgaagac 

aaactatgaa 

agaaaaacac 

gcagaggatg 

aaatcacaat 

ttaaatacca 

gctactgggt 

taactcagag 

agatggctca 

cacccacgtg 

tcacagtacc 

cacaggcaaa 

ggcgcatgct 

gtcagcatgg 

ttgaagaaaa 


aatatctcct 
cccagggcct 
gcatctctgt 
tgccattcaa 
tccacttggt 
nnnnnnnnnn 
nnnnnnnngg 
ctgctgagcc 
caccccacca 
gagacaggag 
gcctacaaaa 
tgctaaaata 
cagacacgac 
gaagcttcca 
tctttctgtt 
cccatgtatg 
accacagcat 
agagttcaga 
tggaacttgg 
cagctttctc 
tctgagttat 
aacaccatcc 
accacctcaa 
gtgatttatc 
ggtgacggtg 
ttttcttttt 
caagtgagct 
tcgatttatt 
ccatgggtat 
agctgt cctg 
agccttcgag 
tggctcagca 
acccgcgtaa 
gtgtgccatg 
agcttagtaa 
agtagaaagg 
tgtacatcgt 
aaaattaacc 
acccagggaa 
cagaagccat 
gtactgccca 
aggtttgcac 
aaggacttta 
attctcataa 
atgtgaacca 
ataacaaaaa 
ataaataaat 
agtgggtaag 
atggtggctc 
agctacagtg 
gaactatgaa 
ttgcctggcc 
agggcatcct 
gataaaagag 
cgcaggaagt 
ggactgaggg 
agcagcttgg 
gtggttaaga 
ggcacttaca 
aggcatgtac 
acttacataa 
tttaatccca 
tctacagagt 
taaaaaaaaa 


taaaggatct 

tgcacatgct 

gctgtcaaat 

aatggatgac 

agcccaggaa 

nnnnnnnnnn 

ttactgggat 

atctctccag 

ccaccccgcc 

ctgtcccttt 

cagccacact 

attcgttaag 

acgagccatg 

tccttgctag 

tttctttatc 

tgagcacttc 

aggtgtgtga 

ggagaactga 

gctcagcctt 

tttttctctc 

cagaactgct 

ccactcggct 

ccagcaaata 

gttaggatct 

gctgaccaaa 

cttttacccc 

acattcccag 

tgtctatgtt 

acagaaacct 

tgggttctca 

ttcaattctc 

ggtaacgaag 

agggggaagg 

gaatgaacag 

cttttatgtt 

caagcatgta 

tctgggagtc 

tggggaggaa 

gtcagggcag 

ggagctgctt 

cagtgggcta 

acaggccaat 

gcttgcatta 

aaaaaagaca 

gaagctctga 

tctgatgccc 

aaataaatat 

agcacccgac 

agaaccatcc 

tacttacata 

ctacaagaag 

tgcgtttggt 

gacttgagtc 

atcaacaagt 

gaagaggagc 

aggacaacag 

gtagctttaa 

gcattactcc 

atcatccata 

acagtgcaat 

aatactaagc 

gcacttggga 

gagttccagg 

aaagaaaaaa 


cttctagttc 

cagcaggagc 

atacacttcc 

ctcaaagcca 

tgaactcann 

nnnnnnnnnn 

ttgaactcag 

ccaccaccac 

ccacctgctc 

tctatagtgg 

tatttgttgg 

tcagtgaaca 

acagaggctg 

caaatctttg 

atatttgttt 

agcatatgtg 

gtaccacagc 

gagagtccgt 

ggtggcaagc 

tgttatgtct 

tgtgagagca 

tggatgacga 

tttacctcct 

catgggagtg 

aagaaacagc 

acggtactaa 

atctgttttt 

tgtgtgtatg 

gagaaggcca 

aatgcagcag 

atcatcacaa 

ctcgtgtcat 

gaagaatcaa 

ccctcccaaa 

gaaagtggtt 

tgtcactgaa 

tgagttaggg 

gctgtttatt 

aaattcaaac 

ctggcttgct 

ggctctccca 

ctggtagggc 

tgttgacata 

tttagattcc 

aagtaagagt 

tcttctggag 

ttaaaaaaat 

tgctctttcg 

gtaataagat 

taataaataa 

tcaggaatag 

ctctagcacc 

aagtgacaag 

tgggctttat 

tgagtctggt 

ctagctcaag 

tgagactctg 

tcttgcaaag 

actttagttt 

tacatacata 

agataaatct 

ggcagaggca 

acagccagga 

atcttaaaag 


agggtccccc 
tccatccagt 
agcccttacc 
gagtcagcgg 
nnnnnnnnnn 
nnnnnnnnnn 
gacctttgga 
caccaccacc 
attcctgatt 
aaggtgaata 
aaaatactgt 
accttaaaac 
aaataagacg 
ttccaagctt 
atttgttggg 
tgtgaacacc 
acatgtgtgt 
cttttcctcc 
tcctttatcc 
atccactctg 
ggagtaactt 
aggggaaaaa 
atacatggat 
attacagctg 
tacacctggc 
ggattgaacc 
ccatttcttt 
tgtgtatgct 
gaagagagtg 
caccaccacc 
aaacacacac 
aagcctgaga 
ctctatgatg 
cacacatcaa 
tttacatgcg 
cagcattgaa 
tttctatgct 
tcagctttta 
aggtcagaag 
ctagcacaca 
cagcaatcat 
cattttctca 
aaaactag cc 
cacataacac 
taaaactatg 
tgtctgaaga 
atatgaaaaa 
aaagtccaga 
ctgactccct 
ataaatctta 
ggctgggggt 
accaacgtag 
tgataatcct 
ctgaataaag 
aacacaggcc 
gcccacctgg 
cccaggccag 
gacctgagtt 
caggggatcc 
catgcatgca 
taaaagaagc 
ggtgtatttc 
ctacacagag 
aaaaggagag 


44580 

44640 

44700 

44760 

44820 

44880 

44940 

45000 

45060 

45120 

45180 

45240 

45300 

45360 

45420 

45480 

45540 

45600 

45660 

45720 

45780 

45840 

45900 

45960 

46020 

46080 

46140 

46200 

46260 

46320 

46380 

46440 

46500 

46560 

46620 

46680 

46740 

46800 

46860 

46920 

46980 

47040 

47100 

47160 

47220 

47280 

47340 

47400 

47460 

47520 

47580 

47640 

47700 

47760 

47820 

47880 

47940 

48000 

48060 

48120 

48180 

48240 

48300 

48360 
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gactggagag 

aggacagcca 

aacaacaaca 

catgatggtt 

gcatatatgt 

aggaggctgg 

acatgaaaaa 

tgtgaatcat 

cactccccag 

tgggaatgta 

gattgtggtg 

ggttcagtga 

ttggaatgct 

ttgtcttaac 

cttgtgaaaa 

acataggatt 

agagctggat 

gttgagaaca 

ggttttaatt 

atgtctgtgt 

cccggaagta 

aaagagccac 

tccagaggtt 

ggatccgatg 

tcaaaaaaga 

ctgtccctta 

tacacgtgct 

gcacacaaga 

aatcgggagc 

atgaggatgt 

aattgtacac 

acagacatgt 

agccaggtgg 

gaggctggag 

tcagttccca 

tctgatgtgt 

ccatgataac 

aaggagtaat 

tgttttttct 

tgtcctatgc 

ctttccacag 

agttctgagc 

ttctttcttt 

tttctttgta 

tctgcctgcc 

ctgggctcaa 

attactttgt 

ggctataata 

agtcctgctc 

acacagaaca 

ggaaagagaa 

nnnnnnnnnn 

cagctactga 

ctcatcttgg 

cccagcagcc 

gctcctgctc 

tctcaattag 

cacccccacc 

cttacccttc 

attgccacac 

tctcaccatt 

ccagcccggt 

cagctgtcag 

attagaaacc 


atggctccac 

ggactataca 

acaaaaccac 

cacaaccatc 

ggtgtacata 

gtttgtagcg 

aaaaagacat 

aagctctgaa 

cactgcaaaa 

actcagccga 

gtgcatacct 

catccttggt 

gtctcaataa 

ctctgacctc 

cctaccccag 

tttatactag 

atggtggtgt 

gctagactat 

tttttctttg 

atcacttgtg 

ttattacaga 

ctcgggctgg 

tggagatcaa 

ccctcttctg 

aaaaagaaac 

atgttaataa 

aaagcaagtg 

cttttctgct 

tgaggaagat 

tctagaactc 

tttaaatgaa 

gtgtgtgtgt 

ttgtggctca 

agatggctca 

gtaaccacat 

gtctggaaac 

attgaagcta 

gttagctttt 

tacccctctt 

agcccaaatg 

gttatgttag 

tcaattcctt 

ctttctttct 

taccctggct 

tctgcctccc 

ttcttaacat 

ttctgatctt 

tagactctcc 

ctatattttt 

tttttttttc 

atgagagaan 

nnnnnnnnnn 

ttcctcctcc 

acttctgcgc 

ccccagttct 

tgattggtgt 

ggaccctcac 

ctatacgcac 

taatcctccc 

ggacacacag 

cacaagcaca 

gccaagtatc 

cctggccccg 

cgctagctcc 


agfctaagaac 

gagaaaccct 

ttgttcttac 

agttgcaggg 

catgtataca 

cagaggtaga 

ttagattcct 

agtaagaata 

gatcatgaaa 

tgcagtgctc 

ataagctcac 

cacatagaga 

ataaaagaaa 

tgtgttcatc 

agtctctgtt 

aaaaataaaa 

ctagttatag 

acggtctcaa 

tgtttgttac 

tgcctgaaac 

aggttgtgag 

agagatggct 

atcccagcaa 

gtgtgtctga 

agccacctct 

cacaggtata 

tgaaccttaa 

caagagtatc 

atggggagtt 

aaaaatggtg 

tgcaataaaa 

gttaaaacat 

aaaaataatg 

gcggttaaga 

ggtggttcac 

agctacagtg 

gacttgctac 

tgtcctaaat 

ttactttgct 

ccatcttcct 

gagactggag 

ttctttcttt 

ttctttcttt 

gtcctggaac 

aagtgctggg 

tgtggagaga 

ttgctgcagt 

ttgcaatact 

tcctaagctt 

ttgtctatag 

nnnnnnnnnn 

nnnnnnnnnn 

tccctcctcc 

atgtcctcag 

cttcctgaaa 

cacctgctgg 

cctctgatat 

atacacactt 

caccggacac 

gagatctcat 

ccacttctgc 

cccaactaca 

gctggcgggc 

ctaattgcca 


acttgttctg 

gtttcgaaaa 

agaggacttt 

atccaaggtc 

ctcatataca 

ggtaaaaaga 

gcatcacacc 

agcctagtat 

ccagaaatgc 

acctgtcgta 

tacgtggaaa 

atttgaggcc 

gaaagaaaaa 

aagtctcctc 

cagaggaccc 

tgaataagta 

catccagaac 

caaacaaaag 

ctaacgtgta 

ccaaggaagc 

cagccatgtg 

cagtggttaa 

ctacatggtg 

agacagctaa 

ccactctccc 

tgatactatg 

atacattatg- 

tgtatgaagt 

ctaatggcta 

ataactcagc 

cttgtctcag 

ttcttggcat 

ataataacaa 

acactgactg 

agacatctgt 

aaagtcattg 

tactgctgag 

ttgtttcctt 

ttcccctctc 

gcctcagcct 

tctgctcagt 

ctttctctct 

ctttctttct 

tcactttgta 

attaaaggtg 

aaagtattgt 

caaatccttc 

tggaaatgct 

ccttctaagg 

catgcgttag 

nnnnnnnnnn 

nnnnnnnnnn 

ttcctccctc 

cccagacctt 

cttctgaggt 

ataggcttgc 

accacacatt 

agctgcatca 

tgcccaggga 

caaggagatg 

ctccagcttc 

tccccaaaat 

gcctgctcct 

gttctgagct 


aggtctacag 

accaaaacca 

ggtttgattc 

ctgtcttctg 

taaaataaaa 

ctctagcttg 

atatccaaaa 

gcactgtaag 

agatcctctg 

tacagagcac 

gtagaggcag 

agtctggtct 

gaaaagaaga 

ctcaggaact 

aggctccagc 

tgtatttttt 

tgagacagga 

taagggatct 

tggttgtttt 

cagaggaggg 

ggtgctggga 

gagcactcaa 

gctcacaacc 

agtgtactca 

tttttaaaat 

ccttgtttat 

ctgagtaaaa 

attgaacatg 

caacatttct 

atatatacta 

taatgtggtt 

ggcaataaaa 

taataaaaat 

ctcttccaga 

aatgggatct 

caaggacttt 

tgtgtctgct 

cctttcctct 

atctcctctc 

ccccagtgtt 

cggggaggga 

ttctttcttt 

ttctttcttt 

gctggcctgg 

tgcaccacca 

agctgttctg 

tcatccatct 

ctacagtcag 

tctttattgg 

agtgatcgtt 

nnnnnnnnnn 

nnnnnnnnnn 

ctccccagcc 

ctgctcttgc 

actctccatc 

tcctgactcc 

tccctagtgt 

ggatcctaca 

cactggggct 

tgcctacccc 

tactctctcg 

tctcagacac 

ggcatagcga 

gtccttgtta 


agtgagttcc 

aaacaacaac 

tcagaatcca 

tgggcaccag 

agttttaaaa 

tttaatgttg 

attaactcaa 

gctctgggtt 

aaccacagca 

aggataaatt 

gacgaccaaa 

gctggtctat 

agtcctatga 

cactggtcat 

tgtggttacc 

aaaaaggtgc 

tagccatgag 

gagtagatga 

gaatacatgc 

catcgggtcc 

atcaaatctg 

tggctgctct 

atatgtaatg 

aataaataga 

cctcttgcct 

gaatagaaaa 

ggagtgagtt 

tgaactctga 

ttttggaatg 

aaactcattg 

tagaagatgt 

atacagtttt 

aatgaaaaca 

ggtcctgagt 

gatgccctct 

acaatagtga 

ggctctttct 

ctccctctgc 

ttaacagagt 

gaaaaatact 

gcctgggtca 

ctttctctct 

taagacaggg 

aattcagaaa 

ctgcccagcc 

gccacctgga 

ttcctcgtca 

ctacatcctc 

tttatgattt 

gccagataga 

nnnnnnnnnn 

nnnnnnnnnt 

tcatgctctg 

ttctcctctc 

acctcctttg 

actgttcgtg 

ctccacctcc 

ccagggactt 

ccagagggct 

agagggtagc 

caggaagtag 

tgccagcctc 

ctagggtgta 

ccggctgccc 


48420 

48480 

48540 

48600 

48660 

48720 

48780 

48840 

48900 

48960 

49020 

49080 

49140 

49200 

49260 

49320 
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49440 

49500 

49560 

49620 

49680 

49740 

49800 
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49920 

49980 

50040 
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50160 
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50400 
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50880 

50940 
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51300 
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gaggcacaca 

tagacctaga 

gcaatctgcc 

ccagcccggc 

aaagagtctt 

ggaggacaag 

agaggggtga 

actcttgtag 

tgggagacca 

tggcctctcc 

ctacctctag 

ccttgggaag 

gcaagagaac 

tggttagaaa 

gcccagggcc 

tgcaagtttc 

gcctatgagc 

gttagggttc 

gaagaggaag 

ggtagctgcg 

gctagacacc 

ctgcaggcct 

tgtctagcat 

cctcagtcta 

ggatctgcgc 

gaagcagggc 

taggggcacc 

ctgctgttgc 

atggtacaga 

agtgggcatc 

ataagaaaag 

aaaattactt 

taactggtga 

gataatgctt 

gtgtgtttgc 

ttgctctcca 

aagacaggct 

gcctcgaact 

tgtaactaca 

ttgaaatgac 

ccaaaggtcc 

gatctgactc 

taaattaatc 

ctgccccaag 

catgctaatg 

ttcttttaag 

gtaccgtgtc 

gttgtgagct 

taagtactct 

tctaaaagta 

gttgaacaaa 

acagactttg 

tcagcaagct 

aggtggggct 

ttgcccctcg 

ctctctctct 

ctgcttcaac 

cctttttttt 

aggctcaggc 

ggatggaaag 

cactcatgtc 

ctaagtggag 

cccctccaca 

actaaccaac 


tagaggaaaa 

gaggactggc 

ccgtgggtgc 

ccattcattc 

gggagctaaa 

agagagaaga 

cttgggtgac 

ggtaggtcag 

acacttgtct 

agcctctgct 

ccaaaactgt 

aaatgaagat 

agcttctaca 

gagtcacagg 

cctcctcact 

aagtcaggac 

ctgttgtcag 

ctcacactgc 

ggtcggttca 

gctgcctcca 

accagattct 

gcccagaaca 

cctggctaaa 

gacaacactt 

taggtacagg 

cacctctggc 

ctatccccaa 

agatggagtt 

gtatcccagg 

acgaccacaa 

gagagaacat 

ctgtactttt 

agtggaataa 

ttacacctct 

tcatgggtat 

cttttttttt 

ttctctgtgt 

tgcagagacc 

catagctccc 

tgtttggggc 

tgagttcaaa 

cctcttctgg 

ttttttaaaa 

tgctgggatt 

tatcccaggc 

gtttattttt 

cttgcagtac 

gcatggtggg 

taactgctga 

ctagggatgg 

gggctttgtg 

actcttctag 

gtacaaacaa 

actggctttc 

tctttcaccc 

ttaaacagga 

cttctgagtg 

tttttttttt 

ctccctgggt 

atggagccag 

cactgggaga 

ctgagcaggc 

tggctcctcc 

agaatggatg 


ggctgagagc 

atgtggccag 

ctcccagcca 

tttattcagg 

gggtcccttc 

aggaggaaga 

aggacacagg 

cctccaacca 

ctccttttcc 

tacaaatgtg 

gtccacaagt 

tccccatgcc 

ctggtgatgg 

agctgagagg 

tgggggagct 

caaaggtctt 

gggaatctcg 

ccacaaagag 

cagtgaagga 

caccagcctc 

gctcagagat 

gcagatgact 

gggaacactt 

ctagaaacct 

acagaaagtc 

tcaggcacac 

gctccttacc 

taggcagcca 

tcaggttggc 

agctcatgtt 

gaggaccaga 

gagacaagag 

taccacttac 

ctgtgtgcac 

atggagtcag 

tttttttttg 

agccctggct 

cacctgcctc 

tcttttttgg 

tggagagatg 

tcccagcaac 

agtgtctgaa 

agagaaagaa 

acagagctgt 

tgtcctcaga 

atcttatgtg 

cctcacagtc 

tgctgggaat 

gccacttctc 

cggatgtgcg 

catgccaggc 

tagtagtgtc 

gctatgggcc 

aaaatgaccc 

tctcccagag 

tctggttatg 

c tgggattat 

tttaaagaat 

accagcataa 

aggatctttc 

tactgtagta 

tcgtatcctc 

ctgcagaact 

agcatgtgtg 


tgagccaggc 

ggaccaaacg 

catccatttg 

tggcataaaa 

cttgcctcag 

ctcctggcag 

ttggggacct 

tgaagtataa 

cacccagggt 

gagggaccct 

gccagcccac 

tgccttcctc 

tccttccggt 

ctgatccagg 

gaactggggg 

gcctccatgg 

ctgttcctgg 

gggcacgcct 

ggagagggac 

gctgagctcc 

cccacgaagg 

ggtcagtgct 

gaacccagcg 

tttttttttt 

tagaggagca 

tggcagctcc 

cagttggctg- 

caccttggtg 

tagtacctcg 

gttcttaaag 

cagcacctgg 

ccagaaattc 

ccctttgcaa 

acataagcat 

aagtaggtaa 

gtgttgccat 

gtcctggaac 

tgcctcctga 

acacagggtc 

gctcagcggc 

cacatggtgg 

gacagctaga 

atgatggcta 

acaacaagcc 

ctctctatgt 

tatgggtatt 

cagaggaggg 

caaaccctgg 

caccttgagc 

ttcatgtgcc 

aagcactcaa 

tccactacag 

ttcctgtcct 

atagaggagc 

agtcagctat 

tggccctaac 

ggtgtaagcc 

ccatgcagtt 

tgcctgcagg 

ctctctgaac 

ttcaaggaag 

tcaccacggg 

ggcaatgctg 

gtgccacaca 


tggcatggag 

tggcacagag 

cccagaactg 

atcactacaa 

tccccaagat 

tgttggcatc 

gaatgtcttc 

caccaaggcc 

gtctggaata 

aagttaggga 

aaaagatcac 

caggccccac 

cccaccctat 

tggggactca 

tagtcttcct 

aagtcagctt 

agctggggca 

atggtgtcct 

attcgattca 

atggtagact 

tctgggccct 

gccccaaggc 

gttgattgga 

tttttttttt 

tatcaaatgc 

cgtactctgc 

agggtggcca 

ggcctctcct 

gacacgttcc 

gggaaatgag 

acctgtctgg 

agggttagca 

ggtgacatgg 

atatgtttgt 

acatcagtcg 

ctttttgttg 

tcactctgta 

gtgctgggat 

tcatggatcc 

taagaacact 

ctcacaacca 

gtgtacttac 

catacttctc 

caagtttgtt 

aattcagaac 

ttgcctgagc 

caccatttcc 

tcctctgcaa 

ttttcttcct 

tggtttatgt 

caactgagct 

cctgagttct 

tgcctctcag 

cacagcaaac 

aattcgagtt 

tatcttcaac 

accacactca 

aggacagcat 

cgggtcctct 

ctcaatgtcc 

aagcaacagg 

ctacagagaa 

ggcccggctt 

cctgggaccc 


gtagccctag 

agggctcagt 

tgacgtcaaa 

aaactttaca 

tcctggcagg 

tccaaatacc 

agcaagggac 

agtctaagct 

tgtctaaaga 

cttgcctaac 

cccctgagcc 

cccacctgct 

cccacaaagc 

ggatgctgct 

ccatgcgggg 

tgtcattctg 

gcgcgctggg 

ccatgatgaa 

tggctacgct 

gatgttgcac 

ggaacaattc 

tatgtggatc 

atctgttaga 

ttttaaatca 

tcccatccag 

ccagaccacc 

ccaggtccag 

gcagcgaggg 

actcaaaata 

ccacctacag 

agtctgggcc 

tgctttcact 

gaccaaatga 

atcggtgtga 

tcttcctaca 

ttgttatttc 

aatcaggctg 

ctaagatgtg 

caagctggct 

gactgctctt 

tccgtaacaa 

atgtaataaa 

tctcgtctct 

gtgttttaga 

gaccttgaac 

atttgtctgt 

ccctgaactg 

gagaagccag 

cctatctcga 

gttgctaagg 

acacatcccg 

ctatctgctg 

ttctctccgc 

agtaggaagc 

tttttttcct 

ttcagtcttc 

gctcacacaa 

ggaaatgacc 

gccagtgggg 

cacagtgaga 

aaggtgagag 

gtctggctgc 

gcccagtcaa 

cagcactcag 


52260 

52320 

52380 

52440 

52500 

52560 

52620 

52680 

52740 

52800 

52860 

52920 

52980 

53040 

53100 

53160 

53220 

53280 

53340 

53400 

53460 

53520 

53580 

53640 

53700 

53760 

53820 

53880 

53940 

54000 

54060 

54120 

54180 

54240 

54300 

54360 

54420 

54480 

54540 

54600 

54660 

54720 

54780 

54840 

54900 

54960 

55020 

55080 

55140 

55200 

55260 

55320 

55380 

55440 

55500 

55560 

55620 

55680 

55740 

55800 

55860 

55920 

55980 

56040 
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acagctgggg 

gaacaaacaa 

tactctagcc 

tggcacggag 

atctatgaag 

tgggctccgc 

atattttctt 

aaacccttca 

ttgccagcca 

tcgaagagga 

gtggaagaaa 

caggcagagg 

ctatnnnnnn 

nnnnnnnnnn 

ggaaaaagga 

ctaatgattg 

atataaaaac 

tttctgtctc 

aagaggaaat 

acagaaacta 

acagggtttc 

tcgaactcag 

gcgccaccat 

tttaaaaaaa 

tgcccttgcc 

tcagcactca 

tacataacaa 

aggggccact 

aactgtcctc 

tttgtaattt 

tatgaacatt 

caattacgtg 

gtctattcgt 

ctcccagttg 

aaggattaca 

atctttgcat 

agggagtagg 

tccctgtggg 

aggctctaag 

ctattcactc 

tgagtccttc 

caatccaagg 

taactgcagc 

taatgcgaaa 

gagacattca 

agaatctata 

gagaaaattt 

gtcaactttt 

gctccttcct 

gtaggctttg 

gaaagatgtt 

tgaattgtat 

cagagagaca 

ggttcgtggt 

ggttagttag 

ttttttacaa 

ggaaatatta 

gtagccgatc 

tcgtaggtat 

gagtggggat 

ctcctctggt 

ggaggatctc 

attttgttta 

cttgccagcg 


cagaagggtc 

aactagcccg 

aaggagtcac 

cttcatccaa 

cactgtgtgg 

taaaacccca 

tgaggagatg 

ggagctacct 

aaggcaaagt 

tatgacaccg 

tctttctggg 

gcaggagcca 

nnnnnnnnnn 

nnnnnnnnnn 

atcttaaatt 

atgtggaaag 

acaggctgaa 

aggttcctac 

aaaccctttc 

taactaatat 

tctgtatggc 

aaatccacct 

tgcctggctg 

aaaatggcca 

ataaattatg 

tgagcctgag 

gatcctgtcc 

gcaaagacag 

ctctctgtaa 

taattagttt 

ttcaaggcac 

ccaccagcaa 

agttctgatc 

agtgttctca 

gtgttaacta 

aaattctttt 

tctggccact 

aagggcacat 

tggtcctgag 

ttctcatttt 

tattgttttt 

tacacttttg 

tgatatggta 

tattgagtaa 

actctgacac 

aataactcac 

tagcaactgt 

aattatgcag 

gctgcctacc 

gaggggcgga 

aatgcagttt 

ttacatgccc 

acatagttct 

gtttcctttg 

gattcctggc 

aaccgtttgc 

tttccctcct 

ttacatgtat 

gcagttagaa 

ggccttttgt 

ctatcactat 

tgcttctgaa 

aagagcaggt 

ccttgaccac 


atgagtccaa 

tcctgttatc 

ggttaggcta 

tcagaatgtt 

gaactcctct 

gcacagagaa 

aacctcaaag 

acccagggac 

ccagactgac 

cgtgcatcat 

tgaggctcgg 

tgtaacataa 

nnnnnnnnnn 

nnnnnnnnnn 

agaagatgcc 

tcacgaggga 

caaaccacaa 

cttgacttcc 

ctccccacat 

agttggtttt 

cctggctgtc 

gcctctgcct 

gttttctttt 

tggcatataa 

taaaaatttc 

gcagaggcag 

aaataataac 

tatgacaaaa 

ttttcagttt 

tttaataata 

ttgatacata 

tgttagcttc 

ttgttatatc 

ttgaatttca 

ggatagactt 

taatcaaagt 

gatgagtgtc 

ctgtgttgta 

ctagctaact 

gcagagcact 

cttttactta 

ctaaagaatc 

catacatatt 

tttttaaaaa 

cagcacccta 

gaagctgaca 

tctaatagag 

acgctgatta 

tttaaccgtt 

caggcactca 

gtagaattat 

tgtccctgtc 

tatatttaat 

ttcattacat 

acgcggacag 

ctttaagagt 

gcttaaaccg 

catacctatt 

gcctctagtt 

gtgccccaag 

cctgccactg 

cttctttggc 

aggctgttag 

tgttaacttt 


agcgaacttg 

tcagccacag 

gaagcaaggg 

cagagcaata 

cttcccttac 

cagttacctg 

aagctgtgtc 

cagactttac 

ctgtatctca 

gtccaccgac 

gtcaaacttg 

aatcagcctn 

nnnnnnnnnn 

nnnnnnnnnn 

ttgataagat 

tggtgtcacc 

agcagtagtc 

ctcagtgaag 

agtttttggt 

ctttttttgt 

ctggaactca 

ctgcctccca 

ttttttaata 

tataaaaaga 

catatggaca 

gaggatggtg 

aacagtaata 

ccactggccc 

ctaattttta 

gaaacaagct 

cctcagattt 

cttttttccc 

cctttttatt 

ttagcagctg 

tgacaaagaa 

tcctcaaaag 

caggccacag 

acaggattcc 

gcccttggct 

taccaggtag 

aatcccattt 

actcattttt 

ctctcttcct 

tcaatactca 

ctcagttcct 

ttactcactg 

cctgccagtt 

ttcagacgac 

ctcagtgctc 

aaccagctaa 

tgactaaaat 

ccccatagca 

tttttccttt 

ttgtatccag 

tgacaccaca 

cagtgttttg 

aagctgtgat 

tctggcatat 

aaatgaaatg 

gctgttgtgt 

ccagaaaacc 

ctgagaaact 

agcttgggtc 

tgttaaccaa 


tgtaacattg 

atgatgggcc 

aagccttagc 

agctttgaaa 

gagcagggcc 

gcacgtgaca 

ctggatagac 

ctcccagtac 

ggttgctcca 

actgtgaacc 

gtcctccaga 

ctgcctgtct 

nnnnnnnnnn 

nnnnaaccaa 

tggcatgtag 

ctgggcagat 

ctcaatggct 

gcatgtcaca 

tatgatgtta 

ttgttttgtt 

ctttgtagac 

agtggtggga 

catttataat 

agtgcttaca 

tcagtctcaa 

agctcaaggc 

atttcataca 

tgcctaattg 

cataactacc 

aagtgctaag 

gccctccagg 

taccatctga 

gtttccctgg 

tttcattaat 

ctatgagaac 

cctctctctg 

tacaggtgtg 

tgtcttaaca 

ttcccttgat 

ctatgtcctg 

gaaatgcgcc 

atatgcaaaa 

attatccact 

attttttaga 

gagccttcct 

tgttgcagtc 

atcagtagtt 

aaattgttgg 

attagcacat 

gcacttagag 

tgagtcattt 

acagataata 

gtcgaacatt 

actagttact 

gttgtctgat 

cacatttcac 

cataatttaa 

gtttgtctat 

ttgcgtgtgt 

ttcacacagt 

ctgctgtgtg 

ccataaccaa 

ttacatgtct 

ctcatctttt 


tcagaccctc 

caaggatcag 

tgagacagct 

cccgacttcc 

ctggtcctct 

aaaactcaat 

acagcataat 

caggcctcgt 

gcaggaacca 

gctcatccag 

aacctgcagc 

tgcctagaac 

nnnnnnnnnn 

ggcaggtctt 

gtatgtctca 

ggcctggggt 

tctgctttag 

tgagagttgt 

tatgtcaaca 

ttgttttgag 

caggctggcc 

ttaaaggcat 

gcattttaga 

aatcaccatg 

gcttacaatc 

cagcttagtc 

tagaactaga 

tattttaaat 

atgtattctt 

aatattttca 

tgagcagtac 

ttctgtttca 

gttccaacac 

ggcacagaag 

atatcttatt 

ttcccatctc 

cgtggttctg 

agccttgctc 

taccagataa 

gaagtacgaa 

agggacactt 

tgtcacctat 

aataggtgac 

aataattaga 

ctgccggagg 

atttttttct 

gagaatgcaa 

tgcctgcacg 

gttccagaag 

aagctctgat 

ggattccctg 

ggattgtctg 

ttcacatgat 

tctgataagc 

cgtttcccac 

ccagattatt 

gcctttctag 

tacaaagacc 

gatgaacctg 

tgttttctgc 

ttccccgcgt 

atcagttagc 

cccaggtcca 

gctgcctgtt 
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56700 

56760 

56820 

56880 

56940 

57000 

57060 

57120 

57180 

57240 

57300 

57360 

57420 

57480 

57540 

57600 

57660 

57720 

57780 

57840 

57900 

57960 

58020 

58080 

58140 

58200 

58260 

58320 

58380 

58440 

58500 

58560 

58620 

58680 

58740 

58800 

58860 

58920 

58980 

59040 

59100 

59160 

59220 

59280 

59340 

59400 

59460 

59520 

59580 

59640 

59700 

59760 

59820 

59880 
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ttttgggggg 

ctctcttccc 

ctttgctttt 

agccgtgcac 

atcactgcct 

acaaaagagg 

ttctctgaga 

cctgagactg 

ggggttttgc 

tcgttctagt 

gggagataat 

aaagcacccc 

tttgagtggc 

tggggaccag 

agccatgact 

acaaggacgg 

aacgagagca 

gtcagcctcc 

gccaccgtcc 

gagccagcct 

ggacaggaat 

aagggcaatg 

tgagggcagt 

ggagcacttt 

cccgtgtttt 

ggctagtgtc 

gactaggcca 

ggtaatctct 

ctgagttctc 

gcagctgcag 

gtaaatctct 

accattcctg 

acaggttgta 

ggaccaaaaa 

aaagttgtgc 

gaagaaggat 

tccaaggagc 

gttcaattcc 

atccgaccca 

catgagagca 

cattcatgcc 

agtaaatgga 

ccagcatgtt 

tgccctctgc 

gtgctccctc 

aacctagagt 

ttctcaaatg 

agatgattat 

atctagaatt 

nnnnnnnnnn 

nnnnnnnnnn 

tgccttaaga 

ggagttctgg 

gtgtgtatgc 

tgtaggtcct 

tacaggcccc 

ctcacacctg 

aagccagcct 

cccgttttaa 

agcttcatta 

tttattactt 

acagtggctg 

tgtctgtcat 

caatgtacat 


tttttttggt 

agcctctcaa 

tttaaaatag 

atggtatgag 

agaggcccct 

ttgtctgagg 

tttcagagca 

cagagcagtt 

acacacttaa 

agccaaccaa 

cttaggagtg 

actggtaatc 

acacagacat 

agtttgcgat 

cagatgggct 

gacaaatgtc 

tcattaccaa 

tgaaccactc 

gggcagcaca 

ttctttccac 

cagcagatat 

catccttggg 

gagccatatc 

cttggcttct 

gttgtttttt 

tatcactgag 

gcccctcggt 

ctccagggcc 

agtacttctt 

ctcccagatg 

gctgttccca 

cttcccttct 

tttcctaata 

aaaaaaaaaa 

atcaggccta 

ctcaagtcgg 

tatttcagag 

atctctctgt 

catagttttc 

gacccttggg 

ccaatgcaaa 

attggctgtg 

ttgttttgtt 

tacctctccc 

gatcctgctc 

tcacttcaga 

cagtgatatt 

cctttgagcc 

tattctaaag 

nnnnnnnnnn 

nnnnnnnnnn 

gttctttgtt 

ctgttgtttt 

gcgcgtgctc 

ggggatcaga 

ccatggacaa 

tgaccctacc 

ggactacaca 

aacactgggc 

acggcctagt 

tttctctgga 

ctgccaaagg 

ctaattctag 

aaaataataa 


tttgtttaag 

atgttagaat 

agttttgcca 

gaactcacat 

ttcttccttc 

cataagacta 

gtgactacct 

tccaaaaatt 

ggtgcgtaag 

agagcagttg 

ctatgcttct 

cctttttgat 

aatggttagg 

gagtcatgtt 

tcaggcccag 

ctgacgcaca 

cagcaggcag 

ctgcaggccc 

gaggatcgca 

taccctgctg 

gggcccagtg 

ccaataaaaa 

cctccatgcc 

ttcttcctgc 

ctgagatgat 

ccaggacccc 

agagtaggca 

tcacgagagc 

tgactgccgc 

ggaagccttc 

ggggcagggc 

tgtaactcct 

ttttccctat 

aaaaaagccc 

gcacttgggt 

aggccagtct 

atggtttaat 

gctggcctca 

cctcagcctt 

tgctactgcc 

atggtctcca 

ttgaaaattg 

ttgtgtttta 

aggtgtgcca 

actcttgaga 

gcccaggctg 

tacaggccta 

atatctctct 

ctgactggcc 

nnnnnnnnnn 

nnnnnnnnnn 

tatgggaatt 

ttaatcttta 

aacttgcaaa 

ctctgttagg 

ctttttcttt 

accactcatg 

gtgagtaaag 

aaggtgaaac 

tatctttaaa 

acatttaaaa 

actggagttc 

gggatctgac 

ttaaaaaaat 


ccaagatcag 

tacaagcatg 

tagcaacaga 

atgtgtgaat 

cttgcagttg 

ccttcaataa 

tcaataaaac 

ttagacaaag 

taaataaact 

aaccaaagca 

gcgtcctaca 

tatttttttt 

cttcggtgtg 

ccatctgatt 

ctgctcagtt 

tttccttctg 

atacgccttg 

aacaccagtg 

aaaggacagg 

tgcctcacac 

cctctactca 

aggtcttctg 

ttgtagacag 

cccagcagct 

agggcagagt 

aacccagcac 

ttgactctct 

cggatgggcg 

tgtggggcct 

tgttgtaaga 

tccagctaaa 

ccctacatgt 

aagtgacaca 

tagaaattta 

actggtaacc 

caagtgacac 

ctggggaccc 

tcagacacac 

tgacccagag 

aggtaatggt 

caccagagca 

taaagccaag 

ggcagcgtct 

ccatgctggg 

cagtcttcct 

gcctcaaact 

cacctggctt 

tctgcttctt 

tcagaattgc 

nnnnnnnnnn 

nnnnnnnaca 

tatgggaatg 

ttcatcttgc 

attgcaaaat 

cttggtggta 

gaaaacctgt 

aggaagaggt 

gctatctata 

aaaagtcaat 

actgtatgca 

gtactttagg 

actcccaagc 

accctcacag 

gaaataaaat 


ttatatggcc 

catccctcag 

aatctaacct 

ggaagttcat 

tcgtgctagc 

aacatgcaca 

atggacagac 

ggtaggatga 

gagctacact 

cctagacttc 

agtattatga 

ataaattcta 

tgctcattca 

tctgtcggat 

catcttctgg 

ttcttgcact 

ccacaggcat 

tgtccttcac 

agattctgca 

ctcacatgaa 

tcctctgagt 

gctgtaataa 

cctatcctga 

tggaaacgta 

acccaacctc 

caccatgcca 

tggtgtgact 

ttgctgagag 

gtcacgctga 

cttgcagagc 

ggttgtcagt 

cgcccggtcc 

aaatcttaaa 

cttgctcaaa 

ctagcactca 

cccatctaag 

agattgtgga 

tctgtagtaa 

ggaagagcca 

gtagacactg 

tggcattctc 

ggtcaagaat 

ctctgtgtag 

cctaagcgcc 

tctactctgt 

tgagatcctc 

tccctgatag 

cctctcttcc 

catccttctg 

nnnnnnnnnn 

tagtgtcaac 

ttccacagaa 

gtgtgtgtgt 

tcagtctcct 

ggtgctttac 

ttctggcttg 

aggaggacta 

tactcaccac 

taatttcaca 

ggttagtact 

ggctggagag 

acccaggtgg 

actcacaggc 

accaggcaag 


caggctgagc 

catacctttc 

aactaagcat 

agagaccggc 

tgactgtact 

gacagtttgc 

ggtttgctta 

agaaggctgc 

gacaggatgc 

aaacatcgtg 

aactgtctag 

gtcttggggt 

ctttgcttcc 

ccggctgcag 

ggaatagatg 

tccagggtct 

cttccctgtt 

tggcactgtc 

ctgctatggt 

aaggataagg 

ctttcctgga 

aaaagcccgt 

aaatgagcga 

tccactttca 

atataggcta 

gtcacgtgat 

aggaactgtg 

gcagcagagg 

gacactgaga 

gctcatgcag 

cgccaggaga 

tgcagaaaac 

ttacacaaag 

taagtcatca 

ggagg°tgag 

agatcaccat 

ttttctgtct 

ctgtgggaaa 

cagtggagag 

gagtcttcaa 

attagaaata 

gaagccttcc 

ccttggctcc 

ctgtgcatta 

atccccagat 

gtgtcccagc 

attcctagta 

tgcagggttg 

cctttagnnn 

nnnnnnnnnn 

tttcaaattc 

cccatccagc 

gtgtgtgtgt 

ctttccaccc 

tgagccatct 

ggtgtgatag 

acagaattgg 

atggcaagac 

taaagtcaat 

tggtttcaat 

tcagttaaga 

caatcacaac 

agtggaacac 

gtggcacacg 


59940 

60000 

60060 

60120 

60180 

60240 

60300 

60360 

60420 

60480 

60540 

60600 

60660 

60720 

60780 

60840 

60900 

60960 

61020 

61080 

61140 

61200 

61260 

61320 

61380 

61440 

61500 

61560 

61620 

61680 

61740 

61800 

61860 

61920 

61980 

62040 

62100 

62160 

62220 

62280 

62340 

62400 

62460 

62520 

62580 

62640 

62700 

62760 

62820 

62880 

62940 

63000 

63060 

63120 

63180 

63240 

63300 

63360 

63420 

63480 

63540 

63600 

63660 

63720 
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cctttaaccc 

ggtctacaga 

aaaaaaaaaa 

tcccagaggt 

aacgcaatct 

ataaaataca 

cctttaatcc 

ggtctgagta 

gggccctagt 

ctgacacttt 

acacaaacaa 

agtacgtggg 

gatagtcttt 

aaactgcaaa 

ctaaaatatt 

gttataaaag 

aattcagaac 

aaatacctcc 

tgtcccaggc 

aaaacttgcc 

agtctctaca 

cacctgtgat 

cctccaacta 

gaatgttgcc 

gatttgggct 

tggtagcaaa 

aaaaaacaaa 

agttcagcca 

cgactgctct 

atccgtaaca 

catataatca 

gttaagtgtg 

aaaaggatgc 

acagaatcca 

acgcacattc 

gcgcacacac 

atatgttcaa 

aagaattaat 

caaaaaaatc 

aacactagca 

gctgaaaaat 

cagcgttttc 

taaaagaagt 

gaaacagagg 

aaacaaacta 

tgatggtctt 

ctcccgatcg 

ctagggccat 

gtcccaccat 

ctgccggggt 

ccccagcgtg 

ccgcgcggca 

tggcgggacg 

gagttgttcc 

aatggcaacc 

agcagtgacc 

cttggcactg 

ctgagagttg 

catgtggagc 

tgaactgagt 

ttcctaaaag 

ctgatctgtt 

ttaacaccgt 

gggacccagg 


cagcactcag 

gtgagttcca 

aaggacttta 

cctgggttca 

aacaccctct 

taagtaaatg 

cagcacttgg 

aatagagcct 

gcttccattt 

tgttcgctaa 

aacaaaacac 

cttgagttca 

aaaacaaaac 

tttatctatg 

tttatttaaa 

taaagaatta 

aatccaagct 

attctgacaa 

ttttcatgat 

agatactggg 

atctcctgaa 

gattgctagc 

gccccttgct 

caaagccata 

agttactttg 

agaaactgga 

caaggcctgg 

acaaaagact 

tccgaaggtc 

agatctgact 

ataaataaat 

gagttaaaat 

tctgtggtag 

cagcggagtg 

gccatcccca 

actctttttt 

ttgaaaatga 

aacatgcagg 

tatggggttt 

agagaggagt 

tttaaaatgt 

tctctcgtga 

aaaacagccg 

ccgccgaaga 

aaaaaactcg 

agagaccccg 

acagcggctc 

cgagcccgct 

ggctgcaccg 

cccgcgggct 

agcgcggtgc 

ggcagagcac 

tgagggcggg 

cccgccagct 

cagggaacag 

tgggtacttc 

tgggataaac 

tggccgcaag 

tccagagtgg 

taaagtgtgt 

cccaaatggc 

tgtaatccca 

tgagttcgag 

cgtggtggca 


gaggcagagg 
ggacagccag 
aattgggctg 
attcccagca 
tcaggcatgc 
agtcttttaa 
gaggcagagg 
tgtacttcta 
tccactgttt 
tcatcaggca 
aacactacgg 
aggccggcct 
actattttat 
aatgaagtct 
gaaaactcag 
tttgcaattg 
tcacattcca 
ctgttt cccc 
tttagtcatt 
atatcaccct 
tgaaatacaa 
tgtactagca 
cggcttcagc 
cataagtgaa 
ccaaccaact 
actatcaatg 
caaggagact 
attccagggc 
aggagttcaa 
ccctcttctg 
ctttaaaaaa 
atacttcaac 
aggtgcctgc 
ggaagagaaa 
agatgtcata 
aaaattcaga 
ctttgaagat 
tgaagggcta 
gccggtaaaa 
gttagcagaa 
cttctgattt 
ccagagagat 
aaaacaagtc 
tagacagcgg 
tgagttcttc 
ggatgctccc 
cctccccggg 
cagggtcgtc 
ctaactcccg 
gctcaacgat 
gaggggcggg 
gtgactcttc 
gcgctgggtc 
gtcggaactt 
tcggagctct 
ttcacgcagt 
aaggtaaatc 
gaacccagcc 
gggtcaaaaa 
gtagaaattt 
ctatgcaaaa 
gcctttggga 
gtcaacctga 
ttcgaggtaa 


caggcagatt 

ggctatacag 

gagagatgga 

ctcaaatggt 

aggttacatg 

tgtatactag 

caggtggatc 

cttatcacta 

gcttaaccac 

accaatggtc 

atcctggcat 

ggtctacata 

ttatgaacaa 

aagtaatact 

caaattattc 

ccttaagggt 

gttcaacatt 

tcttcttttc 

cacacaacta 

aaagggggac 

agtagctaat 

gaagctaaac 

ctcattttta 

caaaaaggag 

cagtaacaac 

gcctctagag 

gtgagaagag 

tggtgagatg 

atcccagcaa 

gagtgtatga 

aaaaaagact 

tggtgatgga 

tgtgtaagcc 

caacgtccca 

catatgtgta 

cttagaggga 

agagggcaga 

taaactagtc 

taaggaaaaa 

agaagccaac 

ggctactggg 

gtctagtagc 

caaaaagttt 

catgtttatt 

tgccagtacc 

ccgcggccgt 

tcctctttgc 

tccttacctc 

cgctcgcgct 

tggctagagc 

cctagacgcc 

cgaggccggg 

gcagtgcgcc 

tgcccgccca 

cccctggtaa 

gcgtgcccgg 

agactcagtc 

tcaaggatgg 

tcaatcaggc 

gtcaggtgga 

gtattggaga 

tgtagaagca 

actaaatgag 

aagcaggcag 


tctgaattcg 

agaaaccctg 

ttaaaagcat 

ggctcacaac 

tagacaaaac 

aagctgggtg 

tctgagttgg 

cagttacatt 

tggggcctga 

tctacactcc 

ggtggaacat 

gcaagttcta 

aacatgtaaa 

tcaatattgg 

aaacaacctt 

ccaaggtggc 

tctacagccc 

ttctaagctg 

gcaaacatta 

tgaaagtagc 

atttaccaaa 

actaaatcta 

caaacaaggg 

ttggagtctc 

tgagctgaac 

caaaaatata 

tgtgctgact 

gctcagtggg 

ccacatggtg 

agacagctac 

attccagtgg 

ctaggtgtcc 

cagctacctg 

gagttgtcct 

catactacac 

cataaaggat 

tcgaaggaag 

tgcagagggc 

gttgtcaaca 

aagctcaaac 

aagccactgg 

aataatgagt 

ggggtgatgg 

tgtcttgttt 

gggttgcctc 

ataatttcct 

accgctccaa 

gatggccccc 

cttgcaccgc 

aactgtgcgt 

gatagccacc 

ttcgaggcct 

tgtgtcagcg 

gtcctttggc 

ctgctgctaa 

cgccggtgcc 

tccgccctct 

tagacgcgat 

tttcgagagg 

ttccagtgag 

gcctggcgtg 

gcaaaagttc 

accctgaaaa 

atctctgagt 


aaggcagcct 
tctcaaaaaa 
tggctgctct 
tgtctatatc 
atccatatgc 
gtggtgcatg 
aggccagcct 
ttataacttt 
agcttttgtg 
atcaccatca 
ctttagc ccc 
ggatagtagg 
gaaagaaaaa 
aaatagcttt 
ataaacgttc 
agcctcttaa 
taacgtattc 
cttagatgtc 
tctagggact 
tgcaggctac 
taaacatgta 
gaaactcagt 
aaagagtttg 
caaatgcatg 
aggaacactg 
tttaaaaaga 
gaaattgact 
taagagcacc 
gctcacaacc 
agtgtactta 
ggatggaaaa 
agagtcgggc 
agctcaatcc 
ctggcatccg 
actggcgcac 
ttgctctgat 
ctcagcagga 
cttggctcga 
tgaaacacag 
aattaggtcg 
tgacttcggt 
taggaggatg 
agaaagggag 
tcttagatgt 
cagcatcctc 
ccctgacgct 
ggccgcgctg 
tcgctcaggt 
ctgagcttct 
gccgatccgc 
gcattggcta 
agtggcggga 
cggtgctact 
ggacagacag 
atatagtcaa 
aggcccagag 
tgagttccac 
atgggccaca 
cgatgcggtt 
gatagtgatg 
ctggctggct 
aaggtcaccc 
atcaaaattt 
tcgaagccag 


63780 

63840 

63900 

63960 

64020 

64080 

64140 

64200 

64260 

64320 

64380 

64440 

64500 

64560 

64620 

64680 

64740 

64800 

64860 

64920 

64980 

65040 

65100 

65160 

65220 

65280 

65340 

65400 

65460 

65520 

65580 

65640 

65700 

65760 

65820 

65880 

65940 

66000 

66060 

66120 

66180 

66240 

66300 

66360 

66420 

66480 

66540 

66600 

66660 

66720 

66780 

66840 

66900 

66960 

61020 

67080 

67140 

67200 

67260 

67320 

67380 

67440 

67500 

67560 
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ccaggctaac 

ggctatatga 

atgaaaatat 

aggtaaacag 

tgtacaacca 

tggtaccaca 

agaaatgatt 

cattaataga 

accggtggat 

gcaggacaca 

tcttcagtaa 

catttggtct 

gggcatttag 

ggtaggcact 

gctgtcgtcc 

agctcattca 

cttctcaagt 

nnnnnnnnnn 

tccaagtgcc 

tctgtttttg 

accaggctac 

cactaccacc 

ttatacgtgt 

ggtgaaaagg 

tagctgcttc 

attgaagaag 

cagtattgat 

atgtgctgag 

ctcgcctgca 

aaccaggtga 

tcaccttgcc 

agcaccagcc 

ctggtgccct 

agcagctcga 

ccaatgccat 

tcagcagcag 

ctcggtagct 

gggatttgca 

gcctagattc 

atgtagagtt 

tgaggaagcc 

tccctggaat 

ccgatagaga 

ggcatttttc 

ctgctgagag 

cccaggccca 

ccgtgtgttc 

cgctcttctc 

ttttccggaa 

acagaagagt 

tgagctcttc 

ttggccaggg 

acagtgcttt 

ctgcttcact 

aggtctgcat 

ctgtctgctt 

tcggcccttt 

gattgcaaat 

ctagtaaagg 

gctatgtgaa 

tcagttggaa 

aaaaaataga 

tgaggtcact 

cagccttgat 


ataagatccg 

actgaaagaa 

ttaacttcat 

gtctagaaag 

caacctacta 

ccacagatta 

ttggttgcgt 

aagctgaatc 

gaatggaagt 

atttctacag 

gtgaatggaa 

gaatatctct 

cagttactgt 

aactgctttc 

cagtgctgtc 

gtcctaatgg 

tctnnnnnnn 

nnnnnnnnnn 

actttactgc 

agacagggtt 

cctcaaattc 

acccatccag 

gtttatcatc 

attacaaacg 

tttcattaga 

tctcttcctc 

aactgttctc 

ctgtagcaac 

ggaccttcag 

gattggctcc 

ttactgagca 

acaagatggc 

tgaaagtctt 

gcaccagaaa 

catctgattc 

cagcaagtcc 

agcctgagct 

gaactcgatg 

tgaagttgac 

ggttgttaaa 

tcatctaaca 

gcccggctgt 

ccttgcaacg 

ctttgaatcc 

cttcttgtca 

tgctacctta 

agtgcggccc 

cctcccattc 

tgagacattt 

ctccaggtgg 

aaactggccc 

cacacagtgc 

aaagcatgtt 

gatagacttt 

ggagcttccc 

aatcacacct 

ccctggtttg 

ggcatttggc 

ctaaacctac 

ggctgagaat 

gcacccttgg 

atacaaggaa 

tgaagggaag 

caaatctcat 


gtctcaaaaa 

agacctggag 

agttgctgga 

actggaatag 

tagtttctca 

actctccttc 

gtcacttggt 

ttcatgttct 

cctacctact 

cagagacttt 

acttcaggga 

tttttatagg 

tggttttcac 

ctcttttggc 

actcatagga 

ggagaaagaa 

nnnnnnnnnn 

nnnnnnnnnn 

aatgtgtcac 

tctctgtgta 

actgagatct 

ctttctatat 

taaagtcctg 

cttatcaatt 

attgctatca 

atcaggtcag 

gcttaggaat 

gaatcttgca 

caacaggtct 

ctcgccctag 

gtcctcaata 

tatgtaggcc 

cctgatggtt 

ggctgtctga 

ccagcttaag 

tgagtgctgc 

gagagctgag 

ttagaccctg 

ctttagccta 

tgttggttgg 

cctcttttac 

tgactccgtg 

tggtctctcg 

atagagcaag 

gcagagcaca 

ggtatgctac 

ttgccttgtc 

tgcattcctt 

ctcttctcac 

ttgaaactcg 

aagtcctatt 

tctgtgtttt 

gaaactcttt 

aagctttgaa 

agggaggatt 

cccttggctg 

cctcttattc 

agtgtttaga 

agtgcacagg 

ccctgctttg 

tggcatgcac 

gagtaaccct 

tctagcccca 

ggtatacatt 


aaaaaagtaa 

atcaaaacag 

gtaagaagtc 

tagccatcta 

aacagttcca 

agcatatcaa 

aggatgaaat 

gactttacag 

taagaagtgg 

gagcgacatc 

aattttggtc 

agagagtagc 

gtttctccct 

atgtgttata 

tctggtgcag 

ccctggcact 

nnnnnnnnnn 

nnnnnnnnnn 

cacatccaga 

gccctggctg 

gcctgcctct 

tggttttcct 

gtcccaaaag 

ctgttcaaaa 

gaattcactc 

tgactcctta 

aaaaatagaa 

caaactctgt 

gcatggccca 

gcctcagccc 

aatgactgag 

agtgagacca 

taagctgtgt 

cttccatggg 

gacctaagct 

ctgggctcgt 

agaaagaaag 

acatggtagg 

gagtcaagaa 

aaaatggatg 

tgaaagagaa 

agataaagag 

tgtccaggac 

ccattcagca 

ccgtactggg 

cttaggtata 

tgtttggttc 

gcccccagag 

agggaactgg 

ccatgggcca 

caatgggtag 

cctaggtgct 

tatttcctct 

ttccttcctg 

tggaggcagc 

cctcagtccc 

cttttaaagc 

gacagttttg 

accgcctcca 

gagattaaaa 

aaggccctgg 

aatgagctgg 

atttagtatt 

ggtaagaaaa 


taaaaataaa 

aaaactgagc 

tggaaaactt 

ctggtatttt 

aagaatatgt 

cagctataga 

ctcgattttc 

agctgcggca 

gcttcactcc 

ttccttccit 

tggaaaatgt 

tttatattct 

agtctgtgat 

cttaaggaat 

gttgtgtagc 

tggttagttg 

nnnnnnnnnn 

nnnnnnnnnn 

gttctgtgtt 

tcctggaact 

gcctccagag 

atggcgtttt 

gagatgagag 

attaaacctc 

aggccttgtt 

gctcaagtac 

ctgacttcca 

cagcagggac 

gaagcttctc 

ttcccttgtt 

gacttgaatt 

gactgtgacc 

ctgctgcgcc 

ctttgtgtgg 

ccgagaaggt 

ggtgtgactc 

gactcctctc 

aatctgtttt 

aactaatgat 

ttagaagccc 

aacataagca 

gcattttcac 

tagatctgta 

gttgttgcgg 

ggaaattgaa 

gccggagttc 

tctcttgcca 

ccttaggcta 

ctaaagtctg 

tccagtgttg 

gcctttcttt 

tctgtgtatg 

ttaggacaga 

aggatgtgga 

ctcacccgcc 

tgctctctca 

agtggttttc 

attgttatgg 

cagtggagag 

aaggaagctg 

ttctgtccct 

tccctcaccc 

ttttgtggct 

agggtttgaa 


aagggagaga 

cgtctaagaa 

tgggcaacta 

gatctctgtt 

ctgggtgaat 

aaaccccaga 

tagaactatg 

gcatggatct 

cgaagtctgt 

cttcttccct 

tctgccttgt 

ttatagtatg 

tactagaatg 

gtagtatctt 

tgcccctaga 

agacccanaa 

nnnnnnnnnn 

nnntttagtt 

tgtttatttg 

cactctgtag 

tgctgctgta 

aaaatagcca 

gggctgctaa 

agagtgggat 

tgcgtgtgtt 

atgcaatatg 

cagggaaatg 

cagctagtct 

cgaaccggta 

tattttggta 

taattatccc 

agctgttact 

agatagttct 

ctccagaggt 

tgctctgccc 

aggagtagag 

ttttcagaaa 

gactattcta 

tacaggagga 

agggtaaatg 

accaacagct 

tttgacctaa 

tctgttgtga 

tgccgggagg 

gatggcctgg 

tccttccctg 

tctgaattga 

atggtgtttc 

ctgcccatgt 

aaattggcca 

ttcctagtgt 

gctttttgct 

tatttgccct 

gaaagccatt 

tctagcattc 

actccagggc 

aactagaagg 

ctgccagcat 

acccaagtta 

agggaaccac 

agctctgcac 

agtgtgccac 

gccatacctc 

acatagacct 


67620 

67680 

67740 

67800 

67860 

67920 

67980 

68040 

68100 

68160 

68220 

68280 

68340 

68400 

68460 

68520 

68580 

68640 

68700 

68760 

68820 

68880 

68940 

69000 

69060 

69120 

69180 

69240 

69300 

69360 

69420 

69480 

69540 

69600 

69660 

69720 

69780 

69840 

69900 

69960 

70020 

70080 

70140 

70200 

70260 

70320 

70380 

70440 

70500 

70560 

70620 

10680 

70740 

70800 

70860 

70920 

70980 

71040 

71100 

71160 

71220 

71280 

71340 

71400 
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gatactcgga 

ggcaatatca 

ctggataagc 

ccagagtcta 

tgtctcttat 

gtgccccctg 

aggagtccac 

ggttggcagt 

actgtggccc 

actcgtcatt 

gagctctggt 

ccttcccacc 

cactcacctg 

atgcccagaa 

cgggcttggt 

caaggccagc 

tcttgaaaaa 

aaagggttgt 

tctcactaaa 

gtgctgtgtt 

aatatatagc 

atatcctgct 

ataaatacgt 

accagtgtcc 

agcactgatg 

acctgaaagc 

gtctagcctc 

ggccaccttg 

aagtccaggg 

atgagtctgt 

gggaagccgt 

cactccaggg 

ctcattggta 

tgctgctcat 

ggtagtaata 

gtacattatt 

catacactct 

tgtcatggca 

agaagctggt 

gagtctcctg 

tttggggtag 

tagtccaaac 

gtgctcttcc 

aagctttttt 

ggaagtcctg 

tgatgccctc 

caaagcactc 

cttgagaata 

ctattatctc 

aaaacacctg 

ctgcttctcc 

tgtaactcca 

attatacaga 

nnnnnnnnnn 

nnnnnnnnnn 

ctctctatta 

atccactact 

ttcaagatct 

aaaacagaaa 

caagttgtca 

gggctgtagc 

tggggtttga 

aatgacatat 

aggtctctct 


catggaaaca 

gagagagggc 

ttttataaat 

tcacaaacat 

ctatactttc 

taagaccctg 

tttcgttgag 

gcctgaagct 

tgagagtgag 

cgtccctgtg 

gatgactgag 

tgacctgagc 

ccatgctggc 

aagactaaat 

ggcacacgcc 

ctggtctaca 

ccaaaaaaat 

cactgatctg 

cgatgaatct 

ccactccagt 

aagaaatcag 

tccttaaatt 

cttactgttg 

taattcttac 

caaggcaacc 

agaccactct 

aaacaacatc 

cccagcttga 

cactccagga 

ctgcagggca 

gagtctgtct 

ggagccatga 

gactgtgagg 

ggtttccaac 

gtccagacca 

gctgatgacc 

gctttttaga 

ttacagtgag 

gttaaaagac 

ctgtctgtta 

agctttttca 

tcatgccttg 

tcaaaggata 

tagatgcttt 

ggtttgattc 

ttctggccta 

atacataaaa 

cctgtgctta 

agcactcaga 

tctcaaaagt 

agagggcctc 

ggtccatgag 

gagagagaga 

nnnnnnnnnn 

ntctagagtg 

tgctgcctgg 

tctgcctctc 

tagatttcca 

gaaatcaggt 

aaactttgct 

caacttggtc 

attgatttga 

tacaaggtgt 

gctcaagctc 


gtatgtttgg 

atcagtcggg 

gctggttaag 

acacagtttt 

agatggtaag 

ggggatgcag 

actaaggaag 

ctgagtgcca 

gaggtccggg 

gggcccttgg 

atgttctgtt 

ctagtgtctg 

tgctgcttgc 

ccagtatcta 

tttaatccca 

aagtgagttc 

aaaaaataaa 

caggcagctc 

gtttccctgg 

gagaacttca 

tatgactgct 

tttcattaag 

gcagtttctt 

tcagtggctt 

ctgtagcatg 

ggctgtttca 

ctcctgcctc 

gactgcttct 

ggagccgtga 

ctccaggagg 

gcagggcact 

gtctgtctgc 

ctacatctcc 

tgagacagag 

caagggacag 

agtttgtttg 

acttgctcat 

gaaataaaca 

agtattcaaa 

atttaccctg 

ggaaaaaagt 

tgttttattc 

tatttaacgt 

gtaagatggc 

tcagcagcta 

tgtgagcact 

ctaaattcaa 

tattataatc 

agattgagac 

aaaagggctg 

atttcagttc 

atccaacacc 

gagagagaga 

nnnnnnnnnn 

tttcaggttt 

aactttctat 

agtactggta 

aagaagccgt 

acacttagag 

cttaattcta 

gagcccttgc 

acccagtaga 

ggccttgttg 

tgcccagggt 


tcagagagag 

ttagacacga 

gttttgaatt 

ctcaaatctg 

accctgaggg 

tggacccgac 

ggaactgaca 

tctggaaggc 

agctccggga 

tgtagagcaa 

ggcttggagc 

gcacagagag 

aagagctaac 

aaggctgctt 

gcactcggga 

taggacagcc 

aataagtaaa 

atgctagcct 

aacattccta 

attcttagtc 

tacctcaaga 

actggtgata 

cctgggatgg 

tcattgagtg 

acttcatagc 

cttgcagaca 

agccttccaa 

tactgccatg 

gtctgtctgc 

agccgtgagt 

ccagggggag 

aaggcattcc 

agatgccccg 

ggaaggactt 

atagctatgg 

gatgagttag 

cctagccatg 

gtcctgaagg 

catctgcgga 

acaaggaggt 

ttagtcctac 

ctaaaagccc 

tttatatatc 

tcagtggtta 

acaccagctg 

gcatgtgcgt 

aaaactcttt 

tagtatgagc 

tagatggtca 

aaaaagtgtc 

ctaataccca 

tctggtcgtc 

gnnnnnnnnn 

nnnnnnnnnn 

tttttgtttt 

gtagaccagg 

ttgaaggcat 

agcctagaaa 

ctgtagaatg 

aagagagaag 

atgaagctgt 

ttcgtatatt 

gaggaattgt 

agaaagggag 


cgaaggacct 

gcattccaca 

tgcccaattt 

ctttgcagta 

cagaggaact 

tttgtgttct 

agcttccctt 

agaaaggatc 

aatgaggtcc 

tcatcctcac 

tctcatcaga 

cacttgaaaa 

tgccctctga 

taaagggttg 

ggcaggcgga 

agggctacag 

aaataaataa 

aggcttttgg 

tggtttctag 

ttgtattata 

gacatacaat 

tataatttgt 

caactctgta 

ttcttggcac 

acaggcctcc 

gaatcttact 

gttctaggtt 

tctcttcagg 

agggcactcc 

ctgtctgcag 

ccatgagtct 

aagagcagcc 

agtgctgtgg 

tgcccctttc 

ggttttctga 

tgggaaagaa 

cccaaggagc 

tgcctggcag 

ctgggaactg 

gacttgaaag 

agacaactct 

tgtcacactt 

aggccttatt 

agagcatgta 

ttattccagt 

agtgcacaga 

cattgtctca 

cagccacggt 

agagctagag 

tcagcagcta 

caccgagtga 

tgcataagct 

nnnnnnnnnn 

nnnnnnnnnn 

tttttttttt 

ctggactcaa 

gtgtcaccac 

aggttaataa 

tcagcatgtg 

ctgtcaaaag 

gtgtttactc 

tgaatgttta 

caatttgggg 

cctcctcctg 


gatagacgag 
gtgagcagct 
tgtcaggatc 
tgcccgtgaa 
cagacccttt 
ctgcacagaa 
tctggcttca 
ctgttggtgc 
tgcttactgg 
cctcaagaag 
gaggacggga 
cagattgaga 
tggaaacccc 
tcactgcagc 
tttctgagtt 
agaaaccctg 
ataaataaat 
ctcgatttca 
tagtaatgaa 
attgaaaaat 
tccacttaca 
gaatggagaa 
ttggtttcct 
tcactgtcca 
ttgttagcac 
ctgtaagcca 
tataggaaaa 
ctcacacatg 
agggggagcc 
ggcactccag 
gtctgcaggg 
atgggcgtca 
ttgtgagcac 
cctaaggatg 
ctcatcctta 
gacccaagtc 
agccgttgac 
cagcttttca 
ggcagcattt 
gtttgttttg 
atagttattc 
tgtaaaatag 
ctgtgcatgg 
ctgctcttct 
tcctgggatc 
caaatgcagg 
tgtgacctag 
agcaacacac 
tctgggttac 
agagcacaca 
ctcaaccacc 
cctacactca 
nnnnnnnnnn 
nnnnnnnnnn 
gagacaaggt 
acttatagtg 
accccactac 
gtactgattt 
acatttgtga 
acttgaactg 
cccagcactg 
cc tcatgggg 
gtgagctttg 
gctgtctaca 


71460 

71520 

71580 

71640 

71700 

71760 

71820 

71880 

71940 

72000 

72060 

72120 

72180 

72240 

72300 

72360 

72420 

72480 

72540 

72600 

72660 

72720 

72780 

72840 

72900 

72960 

73020 

73080 

73140 

73200 

73260 

73320 

73380 

73440 

73500 

73560 

73620 

73680 

73740 

73800 

73860 

73920 

73980 

74040 

74100 

74160 

74220 

74280 

74340 

74400 

74460 

74520 

74580 

74640 

74700 

74760 

74820 

74880 

74940 

75000 

75060 

75120 

75180 

75240 
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gaggacatag 

ccaagtctgc 

aactgtaagc 

ctcttcataa 

cctttaatcc 

ggtctacaga 

aaaacaaaaa 

ggagtacaaa 

tcaaaacgaa 

tcttgctttg 

gcattctcca 

gcctgcctgc 

agacatgcct 

acaaatggtt 

agccagtgct 

tggttggttg 

tggaacttgc 

ggataaagtc 

tcattccagc 

tgatcaccct 

tctgagtgtc 

gagaatgagt 

gggggggggc 

acagcctgca 

ccacctccat 

gccttaggat 

tggagggtcg 

gagcatcctg 

aaagcaacct 

cacagaccat 

ataagtgggt 

taatatgctg 

gccccagggc 

aaagcaacgt 

aggggaggcc: 

cagttcttga 

tgtattgtag 

gctcatgggg 

ccagccccac 

catatgctac 

tcaccaccat 

ggggaagccc 

catttccctc 

ctcctgaagg 

ctccagaggg 

atgagagctg 

catcttcctt 

tggattccac 

cctctttctg 

cctgagcaca 

tgaagcaact 

ttatgagttg 

ctactgactc 

aacagaggca 

gtagagagcc 

gatgccaagc 

tccttttgtc 

tcttcagggc 

aggttatcag 

ctaccctgtt 

taaatgacta 

agtggtaggg 

gcatggaaac 

cacccagaga 


tctcctggct 

ctgcacaatg 

cagccccaat 

caataaaagc 

cagcacttgg 

gtgagttcca 

caaacaagca 

gtcgtctcag 

agaaacgaat 

tttttggcgg 

ctgaggaaca 

ctgcttgtta 

ggtatctgca 

gtaagctgcc 

cttgttgttg 

gttggttggt 

tctgtaggct 

atgtgccacc 

ccttctttgt 

ccctcagcct 

agatcttgaa 

tctgataact 

aagccccaga 

catctcagtg 

ccctacagac 

ctgacatcat 

tctttctcga 

agccactgtc 

gagctgtcag 

aggctccatg 

gttctctctc 

gtgtgtggta 

tttcctctaa 

aaaatggaaa 

tgtggaaggg 

gcaggggtgc 

agtgtctggg 

gcagagccag 

gcagccattt 

aagcactgtg 

caccacacac 

tcacccttga 

gtggaccggg 

tctactcgag 

gcaacaaagc 

aatcagcacc 

tgacttgctg 

cgtttgcatt 

ttcctctgtg 

cacagcctat 

tcacgccagg 

ggggaccagc 

ctctcaaaca 

aggagctcct 

c ggtgaatga 

ttcataactg 

ctctgagagc 

gtggaaagaa 

agctacagag 

ctgaggagga 

aaacacgata 

attgtgtgtg 

tctgagtggg 

aa gggcagag 


gccttcagat 

ccatgcttcc 

taaatgtttg 

ctaactaaaa 

gaggcagagg 

gaacagccag 

aacaaatgcc 

ctagtatgtt 

gaatagataa 

tgctgaggat 

ccctggcgag 

tgtgtatgag 

gaggccagaa 

atgtaggtat 

gttttattgg 

tttcttgaga 

caaactcaga 

aactccagac 

gttttgagat 

cctgactgct 

aacccattct 

atttctcact 

aacattgata 

gccatctgtg 

cttgcagttg 

ggggactctc 

gtcaaacttg 

cccagaaagg 

cagcaagaac 

caccacgggt 

tgacttggat 

cgtgcctgct 

gttaggatcc 

ttaatcaaac 

tctcctgagg 

gactccagga 

agggagctgt 

gattaaacct 

cactggtcaa 

gccttcagat 

acacacacac 

agtaagccac 

caatcttccc 

ggttgcccgt 

tggatttgac 

ctcactttga 

gggcatctgc 

ccccgtccca 

gtctcactgg 

tgtttagaca 

cataatgggg 

tgagactcta 

caattctgga 

tgtaggtgct 

cagactctgt 

cccgtgaggc 

ccgaggaggc 

ataatgaccg 

agaagagtgg 

gccagctctg 

ggaggcccict 

tggcccccac 

ggaaattcag 

agggccgtgg 


caagatgtag 

taccatgatg 

tctttataag 

cacattcctg 

caggaggatt 

ggctacacag 

agcatttggg 

tgaggccagc 

acatttgagt 

taaacccagg 

tgcctagtct 

tggtaacctg 

gaggatgttg 

tcagaattga 

gggcaggagg 

cagggtttct 

gatctgcctg 

aagcagccag 

ggtcacaaag 

gggggttaca 

cgtgaccttg 

cctcttcaag 

acttgcccaa 

ccatatccac 

agatcagagt 

acggtcctgt 

gttacccact 

ccacaggtcc 

acaggagccg 

tctggctatc 

caccacgctc 

ctcctgtccc 

actcttgcta 

cattcctgcc 

ccttaccact 

gttgggggct 

gttatcctta 

ggtctgattc 

ctaattcaga 

ctccctctgg 

acacacacac 

gccaaggaga 

tttcatctgg 

gaggatccgg 

tcgaagttct 

aagcatgcca 

cggcttgccc 

ggccctcctg 

ctctttcctg 

tcatggaagc 

cgtgcctgcc 

tagactttga 

agcactcttg 

gtggatgtca 

tcgatgttca 

ccgcatcagg 

cgatattgca 

ccttgctagg 

gatgtggtcc 

ctcacggcag 

taaggaacca 

cgcccctttc 

gtccacctgc 

ccacgctgag 


aactcttggc 

ataatgaact 

agttgccttg 

ctgggcagtg 

tctgagttcg 

agaaaccctg 

aggtagagtt 

atggaccaca 

gtccagtttt 

accttgttca 

gtctgtctgc 

catgtctgtc 

gatcgcctgg 

acctggtgct 

tagttatttg 

ctatgtagcc 

cccctgcctc 

tactcttaac 

tacaactcag 

ggtgtgtcac 

atccttaaaa 

aaaaggaaag 

agttacacag 

cctgcccttc 

ccaagccgta 

cctcgtccaa 

gccctggaag 

acgctgtgcg 

ctgggtccca 

ctcctgtagt 

cttctgttta 

ctggggactt 

ctactccata 

acaagaataa 

gtctagaagg 

gctctgagag 

ccgtgaagat 

aaaaagccag 

aacacttggt 

cctggtacct 

acactcggcc 

aggatgctga 

gcctgcaccc 

ggcctggtcc 

tgagaggaaa 

gaccccagct 

agaccctggc 

ctgtctccca 

cccaccagtg 

atacagacaa 

cttcagaagc 

gaggtggggt 

aggttcttct 

gggttggtga 

gctcctgcca 

atagggacgt 

gatgttttag 

aagagctcta 

tgaagttctc 

ctgtacccct 

aggtcgagtg 

ctgttcctct 

agccttcttc 

tggagacagc 


tcctccagca 

gaacctctga 

gtcatggtgt 

gtggtgcacg 

aggccagcct 

tctcaaaaaa 

aagaagattg 

tgagacgttc 

ttcctttctt 

tactaggcaa 

ctgcctgcct 

tgtataccac 

aactgggatt 

ctgaaagagc 

gttggttggt 

ttggctgtcc 

ccgagtgctg 

cactgagcca 

actgagctct 

tgtcctcaat 

caaaccctgg 

ccagagaaag 

caaaattcag 

tctgacctcc 

tcgtaagatg 

atgaaaatcc 

gaaacagatg 

tccactggcc 

gcatgtgtgg 

aaactcagaa 

aagtggcctt 

ggagtaggaa 

agatggtcac 

aacagatctc 

aagttgacag 

caggacagca 

gaggacacgg 

agatctgtgc 

ctgatatgct 

gcattcaggt 

agagacaagt 

gggcactggg 

agttgttaat 

cataggactt 

cagatcaaag 

tcctgctcag 

tagggaacag 

gagcccactt 

ccaggcctcg 

cccaggccaa 

agaggcagct 

gggggtgggg 

caggggcagt 

tcaggtcgta 

gagagaaaag 

tagacatcaa 

ctggacaaga 

agacagggca 

ccatcgtaac 

agaacctggt 

ccacttacaa 

gacggcggca 

agttgacact 

aggac'gttgc 


75300 

75360 

75420 

75480 

75540 

75600 

75660 

75720 

75780 

75840 

75900 

75960 

76020 

76080 

76140 

76200 

76260 

76320 

76380 

76440 

76500 

76560 

76620 

76680 

76740 

76800 

76860 

76920 

76980 

77040 

77100 

77160 

77220 

77280 

77340 

77400 

77460 

77520 

77580 

77640 

77700 

77760 

77820 

77880 

77940 

78000 

78060 

78120 

78180 

78240 

78300 

78360 

78420 

78480 

78540 

78600 

78660 

78720 

78780 

78840 

78900 

78960 

79020 

79080 
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ccgttgggct 

ctgccagctt 

tgctgtccgg 

agtacatcac 

gaatgagtgt 

gcaacacaaa 

tttaaaataa 

gttcaaggac 

acactgggga 

tcagcgtatt 

ttgactacta 

cacccctctc 

aaacccaccc 

gttctgctgt 

gaggaatttg 

gcttttttgt 

aatgcttggc 

tttttagttt 

gacaggattt 

cttagattca 

ctgtacctac 

gtaacatgtg 

gaataagtga 

acttaaagcc 

cccaccccta 

tattcttgtc 

tggatccagg 

acctcagagc 

aggagtagca 

ccctcaccca 

gcagagtgag 

tcttccgtcc 

atcaggagac 

tcaggcccca 

cggccagctc 

atagacttac 

ctatgttaaa 

tagccttgaa 

tgagccatca 

gggtatgaac 

nnnnnnnnnn 

nnnnntctta 

tgtgtcttcc 

gagggggtag 

taaggaagtt 

gtgagatggt 

tctgtgggta 

tagtggggcg 

caaggggctc 

actcagtagc 

acgccatccg 

gccgagagat 

aggccaggaa 

gggccccttc 

ctgacctgcc 

gtttcgggga 

ctggatgtgg 

gctggtcggg 

ggcctggaag 

gggagtaccc 

ttctccaggc 

tgggcacggg 

aaggccttgt 

tgctggaggt 


ggcactggat 

gttcacaggg 

gactggggag 

ctcctggttg 

gtgctctctg 

agggcttgcc 

aacttgtagg 

ctgggttcag 

ggcagagatg 

ccatctcagt 

gcctactctg 

tgtccccttc 

tcagaccagt 

gacgccctcc 

tcgttttctg 

tttgttccca 

cacccagctg 

tctgtttggt 

cgctatgtga 

gagatccacc 

caacctcaaa 

tgagggcctg 

ctggctagtg 

cacaatccag 

cttcctccgt 

tcgatgttaa 

gctgcctccc 

tgctggggac 

ccagggcctg 

caccctcccc 

cccttcccct 

ctgacctgcc 

ccaccttccc 

agacaatgct 

ctttggtagg 

agaacatctg 

gacacacaca 

gttgcagcta 

tgcctgactt 

cagggccaca 

nnnnnnnnnn 

tttttagctt 

atttgcttct 

gactcaagaa 

acaggtgggc 

gcaagatttg 

aggatgacca 

gagccccaga 

ggagactagt 

cgaacttgat 

ttccatggct 

aggggcgtgg 

gccccaggca 

gaggtcccgg 

cagaccatgg 

ctgcactggg 

cggaacattt 

tgcggaactt 

actgaagttg 

agtgggagaa 

tgtggcagtt 

tcctagtagc 

tggaccggga 

ttgctgtgag 


ctcaggcggt 

accttgaaga 

ccctgggtta 

ggttggtcct 

actccacaac 

agacctgaac 

gctggggaga 

tccccagagc 

ggcaggtcct 

gaggggtcct 

tgtctgtttc 

cctctgcagg 

cttccctatt 

ctgtctttcc 

acctgttttc 

gggctttgga 

tacctcccgt 

ttttggtttt 

ctctagctgt 

tgcctctgcc 

ggaagtgtcc 

ggcttcaccc 

gcaagacagt 

gtctaatatg 

atagtggcag 

agtaaccaag 

tgcactgtct 

gttctggctg 

catcccggaa 

tgccactgca 

ccctccagct 

cctgcacttc 

atctaagctc 

ctagcacaaa 

atttggaacc 

gtctggtcct 

cactttcttc 

ccctacttca 

gactcccttc 

ctgccnnnnn 

nnnnnnnnnn 

gtttgttttt 

tctacctaat 

gggattctcc 

agagctcggc 

ggccctcagg 

gtaggtgaga 

ctagtcaggt 

ggtgttggag 

gccagaaagc 

tcacccgagg 

acaagccagg 

cagcagggtg 

gcaccgggtg 

catcctgggg 

ggctgcagct 

tcgtttcgct 

ggggactgac 

gaaccttttg 

tggatctagg 

tcctcaaatg 

agtgtcagtg 

attctaggtt 

gaattaagga 


acagatcgta 

aggggtcctc 

gaatacaagg 

ctagtccctg 

ctgtgtgctg 

tgtggtctga 

tagcacagta 

tgggcacggg 

ggggctcatt 

gtttcagagg 

tgtctgtctg 

gaacttcctc 

cagcttgctg 

tgacaagcca 

agtgtcagcc 

tcaaagctgg 

ccctgttgct 

gattttttgt 

cctgggactc 

tcctaagtgc 

atggatagag 

ccaacagaga 

attgtctaag 

cccatctgcc 

ctcctctgga 

gcctagaaga 

cttctgctgt- 

ctgccgtgcc 

ctacaagaga 

cattgcacac 

ctcagcccca 

tccttgagct 

cagcacaggg 

cacacaccaa 

ctgtctcagg 

gctatccacc 

taggtaggtt 

tttgcctcct 

ttctatttca 

nnnnnnnnnn 

nnnnnnnnnn 

cttacttgag 

aaagtttctt 

cataagctgt 

tctgcctgac 

agttggctct 

gatgagggaa 

gggggtaaac 

aagacctcta 

cccaaactgc 

tggccccatg 

aaaagttaca 

gaaaagctag 

ccaggatcag 

actccttgtg 

gctgcccacc 

gatacacagc 

aaagcactga 

gaatggaact 

tctgggagaa 

gtcaccgacc 

ggtgagtgta 

cctcccccat 

cctgagaaaa 


gccgaagttg 

ctcctccacg 

accagtaggg 

gggccatgag 

ggaggtgggg 

gaacctgaag 

gataaagtac 

ggtgcatgct 

ggccaatcag 

gcctgaggaa 

tctgtctgtc 

accaccacca 

gctggtccta 

tcccctctga 

tcttccttat 

gctcttacat 

tttcggtttg 

tgtttgtttg 

actatgtaga 

tgggattttt 

ttcagtacta 

agggggagtt 

gtcactaagc 

ttgtccttgt 

tcctgaaagg 

gtgaaggcca 

cccacctacc 

cgagcagggc 

aacaagagag 

tgcacaggga 

agtggaccct 

gtgcccccat 

aagacccagc 

ggcttttccg 

atgggagcag 

aatagttctc 

cttgtgtatg 

gagtactaca 

agagcaattc 

nnnnnnnnnn 

nnnnnnnnnn 

acctggggag 

ggtgatgttt 

tccgtttggg 

tggcgtgctc 

gttggccctg 

ccagaacaga 

tagatgactt 

atgtgttgta 

taaacccaag 

gctgcgccaa 

gcacgctgga 

atcccgattc 

agaaactgac 

gctggcgctt 

ccgaaatctg 

ctactctctg 

ggggcggggg 

ggtttgggtt 

attgacctta 

caattaatga 

cagaaaaggc 

ctttggaatg 

gtgggacttg 


gagacagctg 
ggctcgcccg 
aggcacagtg 
tctgaggtca 
agtgggaagg 
cctggcccac 
cagcatgcaa 
tataatccca 
cctgaactaa 
tgactctggg 
tgtctgtctc 
acccccaaag 
gtctgcctag 
ctagacccga 
gagactttct 
acgttaggca 
gaggactttt 
tttgttttga 
ccaggctggc 
agattttaat 
catcatgtgt 
agtaggtgag 
cttaagccac 
gtgacatgac 
agagggaaga 
a agcccaccc 
caccctactg 
tccagtccag 
caaacgactc 
caagagtcag 
tgacttgagg 
gttggttcct 
agcaggctct 
tggaggcaca 
agcccaggtc 
tgaccaaagc 
tagcccaggc 
atgccaggtg 
ttagttaaga 
nnnnnnnnnn 
nnnnnnnnnn 
gggaggtgta 

gggggggggg 

tagtactatg 

tgaggtaaag 

taccttctgg 

aggtgaaagt 

tctggaaccc 

aggcctctga 

caggagcggg 

tcaatgagca 

aagataatac 

tgccggaggg 

tgaaacctag 

ccttcacggc 

ttcgggaccg 

gagtaggggc 

tggaaaagag 

gtggatgggt 

gctctttgtc 

tcacccaccg 

tgaatcggga 

gagcagatgt 

agatatctag 
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gctgtgcatc 

cgctgcagat 

ctgtcaccat 

aactcaaggg 

ctgaaaacct 

ctcctgggca 

aaaaaaaaaa 

gtaagagaac 

caggattgga 

aataaatact 

catgcgtggg 

ctgaaccttg 

cactatgtgg 

ctacggaagc 

tgggccagcc 

tggttgtccc 

ctggtcaccg 

ttgcagtctc 

gttgccagga 

ccccagggag 

caaggactag 

gagagagaga 

aactttaaaa 

atctctttta 

aacagtttct 

caatctccaa 

cttgggtcac 

tacctgtact 

aaacgctgca 

gtggcacatg 

ggccagcctg 

agagaaagaa 

gttcaatgcc 

tggctgactg 

aggagcttcg 

atgatttcat 

ttatcctggt 

ttaaggtttc 

tctctggtgt 

cggggggtgg 

aaggtcctga 

aatctgatgc 

taaataaata 

atatctctgg 

gcaccaggcc 

gatcagaaaa 

caggcctgat 

gctggcctgg 

tcacaggtgt 

tggtactgag 

catgggctct 

caaacccagg 

ctggtgataa 

gtatgtgtgc 

cccctttaat 

ctggtctaca 

taaccaaaaa 

ggtctgcaga 

acattaaagg 

tgttttgttt 

ggagactctg 

ccacaaagaa 

gacacacact 

tgtggtgcat 


agctctgagc 

ggagttgata 

ttggcggtaa 

cccaacctct 

atcctctcag 

ctgcccaagg 

aaagccagga 

ctctggtcca 

ggcagtccta 

aatttttaaa 

tgtccctagg 

aaccttgcct 

gcttctctat 

tactgctgct 

tggccaccct 

ggcagcacta 

tgggcagcat 

agccctaccc 

acacttccac 

gtggggctgg 

actgtatcat 

gaacattgta 

gtcccacagt 

aaattcaaag 

tccttcaaga 

ccgtccaatg 

ttctccagcc 

ccactgctgc 

tgaccccttc 

ctgataatgc 

ggctacaaga 

aaaactggac 

aagttggcag 

aacaccttac 

ctgaggagag 

cagtctcttg 

ctgcagtgtg 

tctggctctt 

gtgtgtgtgt 

agggggctgg 

gttcaaatcc 

cctcttctgg 

ttaaaaaaaa 

atccccctgc 

aggctagaac 

cagacccagg 

aacctgcggt 

aattcacaga 

gtgccaccac 

gatagaatag 

ccttccccaa 

tcttgtgtga 

cttatagcct 

ccacacaaac 

cccagcactt 

aagtgagttc 

aaaaccactt 

ccaggctggc 

atgtcccacc 

tgttttttct 

aagtcagggc 

accctctctc 

tgtaatctaa 

tcaagagcag 


gaggagcctc 

tgctgtggaa 

gactctgaag 

ctcttcagtg 

acttggattg 

agtttggttc 

atgaaagcta 

gggctgctca 

aaaaatgctt 

aagctgatac 

ataaggagct 

gcaggtagtg 

ggtatccctg 

ctcccataag 

ggtcctcttc 

ccagctgtct 

aagcatctcc 

gtttatcctc 

tctcagtctg 

gaagaggaga 

cagagagaga 

tgagatctcc 

ctttacatat 

tctttttaca 

gggaaaatat 

tctgggatac 

aggccctttg 

tgctgctctt 

agtcctgggc 

tagcacttgg 

gactctgttt 

atgacagccg 

tgcctagttt 

ccagcaactt 

aacattgaac 

tgaatgtgat 

gtgtggagtt 

ttctttcctc 

gtgtgtgtgt 

agagatggct 

caacaaccac 

agtgtctgaa 

aagaagttgg 

ctctacctct 

attctctgat 

gggccacaga 

tttttgtttg 

gatccacctg 

acttggccta 

actctcacca 

aggaaaaata 

tggctcagca 

tgttatgctc 

acacaatcaa 

gggaggcaga 

caggacagcc 

taaatattat 

cttgaactca 

actgcctggc 

aaaaaatttt 

cagccaggtc 

aacaaacaaa 

gcccttgaga 

cctgggtggg 


atagtcttct 

ccagacattg 

ggagaggcca 

ttctgtcctc 

agttggaggg 

tccccacaaa 

gggtatgcta 

tgccccctag 

gggtaatata 

ctggaaggat 

gggacttgtt 

agctgcctca 

cttctggagc 

gccccatcct 

cgccttctgc 

cttgctctgg 

acagggatcc 

atgcacaagg 

aaaggtgtgg 

tggtagccca 

gagagagaga 

attacagtca 

tcttaaaatt 

attaaaagtc 

cagggcacag 

aactcacgat 

tagcacacgc- 

ggtggt c atc 

cttcaagaga 

gaatgacaag 

caacaaacaa 

gaacattatc 

gataagggtc 

gctgatagac 

cctattgtat 

tgctttgagt 

gtggaagctt 

ccggtttttg 

gtgtgtgtgt 

cagcggttaa 

atggtggctc 

aacagctaca 

catggatgat 

tagacactgt 

ctacaaatac 

aatgtcttag 

ttctgggggg 

cctctgcctc 

actgcctgag 

gctcttctga 

aatgtaagaa 

tcaaagctac 

tcctttgacc 

gaaataaatg 

ggcaggtgga 

agagctacac 

ttttattttg 

cagagatcca 

taaagattta 

tttaaaaaga 

tactgagtga 

caaaagagaa 

ggctgagaag 

ctacagaaaa 


ccggtgcctt 

gcccaggcct 

ggtagtaagg 

tgacttttcc 

aggttttgac 

cctccagctg 

tgcaaatagt 

ataagggtca 

aagtgaataa 

gaggcagaga 

gggcacaggt 

gcaccgctgt 

tgaactccat 

tggccttcag 

ctctgggatg 

ttctgctttg 

gaattctgac 

aaaccaagac 

aagttttctc 

ctgcatagtc 

gagagagaga 

ggaaatcagg 

tcaatctctt 

tcaactgtgg 

tcacaatcaa 

cttctgggct 

gtcatcctct 

tcatggtact 

gaagactaga 

cagaaggatc 

acaaacaaac 

tgacattcat 

tagccactct 

tctgccttcc 

atgaataaat 

catttttctt 

tgagttggga 

ttttgtttgc 

gtgtgtgtgt 

gagcgccaac 

acaaccatcc 

gtgtgcttac 

gtagtgaaga 

gagtatggaa 

ctagagtatt 

taggtaaaaa 

cgggagaggc 

ctgagtgtca 

tttgagcatc 

cttccacatg 

ttaaaaaaaa 

ctcccgccac 

tccacgggca 

cagccaggcg 

attctgagtt 

agagaaaccc 

ttttgtttat 

actgcttctg 

ttttttcttt 

accatccctc 

gctctagggc 

cagacccaac 

ttcaaggcta 

agaaagaggg 


caggttattt 

gggaccttct 

gagcatgtcc 

acaaagcccc 

tggctagcca 

atcataaaaa 

gtggcttggg 

gcagaaaggt 

ataaaaaata 

gtagaaaaaa 

catgcaaagc 

tgtgtctggc 

ctgtttgcat 

agtaagcagt 

gatgagtctg 

tgtggctggg 

caaggatatc 

acgtgagcct 

ttctgtcagc 

tactatgtag 

gagagagaga 

agatctaaat 

taaaatatcc 

gctccactaa 

aagcaaaagt 

cctccaaggg 

aggctccaga 

ggcatctcca 

gcctggcaaa 

agaagttcaa 

aaaccaaaga 

aaggtcctga 

ggtaatacca 

agcaaaaggg 

tgctgtgcaa 

ggctccagtg 

gggtttcctg 

ctggtggggt 

tagaagttgg 

tgctcttcca 

gtaacaaaaa 

atataataaa 

ctggcattag 

gtgtaccacc 

attcctctat 

cacttgcttt 

tggctggctg 

ggtaccagga 

agtactcaca 

tgccctgcag 

aaaaaaaaag 

agctgaccac 

tgctgtacac 

aggtggcaca 

cgaggccaac 

tgtttcgaaa 

cctggaactt 

cttcccaagc 

ctttttgttt 

ctagcactca 

agccagggct 

cagacctgag 

gccacaagtg 

agagagagaa 
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tggttaatga 

aagacgagtt 

agttcaatcg 

tcagccaaac 

agggcraatgt 

agatcctcac 

tatctacaga 

gtgagagtcc 

taatatctga 

gaatggggta 

agaactaaga 

tatgttcacg 

agaggccaga 

cagtccaatg 

taactgtgga 

ggcccatctg 

tggctgggag 

actcacctct 

ttcccctaat 

ggtcgctggg 

tgcagcagaa 

gtttctaaat 

gctatttgga 

taaattttaa 

tcacataatg 

agccatttgg 

attttttatg 

catcaattag 

catccctgtc 

cacatacatc 

nnnnnnnnnn 

tgcctggctc 

tttggtttga 

ctcacagaga 

acctggcccc 

tatctatcta 

ggtagtcaat 

tgttaggctt 

atcttccatg 

accaaccaag 

ccaggaccaa 

tactgctctt 

acaggtcttt 

ttgaactcac 

taccacacct 

tcacaggatt 

agcacaaaga 

aatcaagcct 

ccggaaacat 

caggagccct 

tgtgctcagc 

gagtctgaga 

tagatggggt 

cagtagacct 

tccccggggt 

acacatcatc 

gaaatggctg 

ctctgagtct 

ccaaagaagg 

ggaccctgtg 

gttgggcaga 

gtaacaggaa 

cagctaggta 

gctccgtgca 


agatgactct 

gagggtggag 

gttcaggtac 

caacgcagcg 

cactttagct 

acaccttctg 

gataagggta 

ctggaaaggg 

cattcctgga 

taaaccagga 

ttagaggcat 

aatgttagcc 

agaaggtgtt 

tgtgtgctgg 

ggcatatctc 

cgcatgcgct 

acttgcagat 

tctctgtagc 

cctaaatctt 

gctctagaga 

cttttttgct 

ataaccccat 

ataattaaaa 

atgttattat 

aaattaacca 

agtggaggcc 

cacctatcta 

ctttgatcct 

ctcttttcta 

actgtccann 

nnnnnnnnnn 

tgttttgctg 

ttttgttttt 

tctcttgtct 

tttcatctat 

tctaaaattt 

aagtaaccct 

ggtagcaagc 

ggttctgggg 

tcattttccc 

atctagagga 

ctcccttccc 

tctctgcatc 

agctgtcctc 

ggctcatttt 

cacaaaaact 

gttgttgaaa 

cacaccctga 

tatcctctga 

tggagggaag 

cattttattt 

ggaagtactc 

cacttagtgg 

gagtctttat 

cctgatacca 

caactccaag 

aggaaaacag 

gaagagggtc 

aacacggcag 

cgaggctgga 

g a ggggctac 

aaatttgggg 

gagtacacag 

gagtaaaacc 


ggaaaagtga 

aaccgccaaa 

attacttcca 

actattgact 

ttcctttgca 

ccagatccag 

caaggcgtgt 

ctccctgccc 

aatgtatgta 

tggcttcaca 

gcactaccat 

tgcatgtatg 

ggttggattt 

agatgaaact 

cagcctcaga 

ggagacctcc 

cccaagtggt 

cattggctga 

tgccctctcc 

gatggggggg 

gtatattgag 

ctgtttcaac 

aatttcaata 

ttttaaaaag 

tacgcaagtg 

atggcgcttc 

ctactgataa 

ctgtcataac 

catcttatac 

nnnnnnnnnn 

nnnnnnnnnn 

ttgttgtttt 

ttttagagaa 

ctgccttcca 

ctatctatct 

atctgtgtgt 

cagaagttgg 

atgtttaccc 

acacaaatgt 

agccccatcc 

gtgtccatgc 

ttccttcatt 

ccaagctagc 

ctacttcagc 

tcaaataaat 

ttggaaggag 

gaagagagaa 

gttcaattcc 

cctccagacc 

tcctagaaat 

ttcctttgtg 

actgtgtccc 

tagccttggt 

atctgcaaag 

ggttgtatta 

tggctttctc 

agttgcagaa 

agcgactctg 

agggagagcc 

ggggaggaag 

attcatctgc 

ttgagggggg 

taactcccag 

cacacagggt 


aactcaagag 

gcggacgagc 

aaacgccatt 

tctgcatttc 

gaaaggaaaa 

acactggtat 

gtgtttaaag 

tcaccatcac 

tgtattcatt 

tctcagagac 

acttggctca 

tgtgtgcacc 

cctggagatg 

tggttcatcc 

tttcccagtt 

tttaccgcct 

acaagagaag 

gcatactttg 

aaatgtgctc 

gggggagczaa 

tcttaaaaat 

acaaaatgca 

cttgatttaa 

ttacaagttt 

aggctcggtg 

acattgaatt 

caaaacaccc 

tcgtgtccac 

tgaatccaca 

nnnnnnnnnn 

nnnnnnnnnn 

tgtttgtgtg 

tcttactatg 

agtgctgaga 

atctatctat 

gtctgtgtgt 

ctttctctaa 

actgagccat 

gtactttgat 

tgactcccat 

ccaacaaaca 

tcttcgttcc 

cttgaacttg 

ttcccaaaca 

aaaaagaaaa 

actaaaaatg 

gaggaaaagt 

tgaccctatg 

cgctgtggca 

gaatcttact 

tacccgacac 

cagtgagctt 

gtggagaaag 

ggtgttgggg 

agtgtatgtg 

atttttcctt 

agacagggcc 

agaaatgaag 

cccgaggaga 

aggggggagg 

agtatggtgt 

cagcctgtag 

aattctctgg 

ggatttcagt 


aaagcccctc 

cagacagaga 

gccacatcag 

agagacttcc 

gtccctgcct 

gactcagcct 

tatgtgttta 

cgaaagcaca 

atgtagccct 

ccatttgcct 

tgatttactt 

atgtgcatgc 

aagtcccaaa 

acaagagcag 

aatgtttgct 

tgagcctcat 

aataaactgg 

cctcaaccta 

ctcccccgca 

c gggtacagc 

tcatataaac 

acaacaaaat 

aaatgcttta 

aaaaaaaaga 

cactggtaca 

ttatactttc 

atgagagagt 

tccctgcctt 

cactcagttg 

nnnnnnnnnn 

nnnnnnnnnn 

tgtgtttaga 

tagctcaggc 

ttaaaggtat 

ctatctatct 

acattcccca 

tccttggatc 

ttatgacccc 

gtttacagga 

taagtgttct 

ctctgccaag 

ttcttttctt 

tgatgtagct 

ctgggattat 

tcaaaaagtt 

gatttttaaa 

tagcttagta 

gtagaaggag 

cgtgcatgca 

gaagcaggtc 

gcttccattc 

ctgtcttacc 

agaacacagg 

cataatcaaa 

catggtcctt 

gccagtagcc 

atggcctggc 

ctatttctga 

tggagacaga 

agtgagaccc 

agaggggaca 

ggctgggccc 

ctccactaaa 

ctcctttgca 


agatttgctt 

ctgccaacaa 

gatgcttcaa 
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ccccccctcc 

gtaaaagagg 

gctccgtgac 

agatgatggg 

acaccacgga 

agctgctgga 

tcccaataaa 

gatctgccac 

aaagcctcat 

cctgatgata 
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atggggctgg 
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tgtgtctctg 
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gagggagtca 

tccagacctt 
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tttcttcttc 

catgtctcgc 

gcaggggctc 

tggaaagcac 

catgccattg 

caccctcctg 

gacgggggcg 

tctcgtagag 

agatctcctt 

ctcctctggg 

ctagaagccc 

gggaggagga 

tcacaaagag 

gatgtttatg 

gatttatata 

gaacacagac 

aggaagcaga 

caatggccca 

atccgatgag 

actaaagtgt 

aagtggttgt 

ccagcctgga 

gggagggcaa 

tagtgagtta 

tcccaacaac 

gtgtttaaag 

ggccagagca 

atctgtacaa 
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acagagatga 

ggggggctac 
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agtcccagca 

agctgtatgc 

agacagacag 

ttgctattca 

ccaactccac 

tgttgtgcac 

agaaaagaaa 

ccaggtcctt 

ctttcccaac 

ccttcctggc 


acccccagct 

ctggtgagga 

actttaaatg 

tcagggacct 

gacccctgtg 

gcgcaagcct 

cccctttccg 

cttatcacac 

tctctgctta 

acaacccttg 

agaaatgtac 

ctggggcttt 

gttgcacaag 

tgtgtgtaaa 

cacaattgtc 

ttcctctgtg 

gggttgtctg 

agatttattt 
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gtgcttagga 

ggccacagca 
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tgtccccata 

gcagcctcaa 

tcttacgtta 

aaggaaaagg 

ccctcttttc 

tattttttta 

ggaagtgcag 

ttgtaaaggg 
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ggcggaactc 

ccagagtgtg 

gcactggcct 

tattctagta 

ctgcatagca 

catttggaat 

gttcagcggt 

cacatggtgg 

tcagctacaa 

agtagaggtc 

ttacagtgca 
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gcttcatgtg 

agtgagagta 
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agagaaacac 
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agagctgttc 
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ggctggcttc 
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tctgcatgtg 

gcacatcatt 

gcagaatgac 

acgcctctct 

ctttctcttt 

ggtttgtcct 

gctagaacac 

atatgtaagt 

ggatggttgt 

gtcgggtgct 

actgctctcc 

aacatcttct 

tcccaggaaa 

ttcgggaacc 

gggcaaagga 

gcacattttg 

tgccacacag 

actaacggct 

gcgcaggctg 

cactgaaatg 

cctcagcccc 

gcccagcacc 

ggggatgaca 

gaggggagag 

tggctttttg 

gaacccgtat 

tgagttaagg 

ttttgtaaca 
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ctgcttgaaa 

gccatagctg 

ttccttgaca 

cttaggaggc 

acacccagtt 

gtaaataaat 

taagagcgct 

ctcacaacca 

tgtacttaca 

ctgagtttaa 

ctcatataca 

ggagggaaga 

gaaacacagg 

tcatgagttc 

aaaaaaaaaa 

tgaggcaggg 

ctatcccctc 

aaagaaaggg 

cctgagctca 

tatgatctct 

attagaaaaa 

ttccacgtca 

tgggggctga 

gcaaactcaa 

atgcgggggc 


ccagtcagag 

tctgtaccct 

aacagtgaga 

gaggctcaga 

gcagagggag 

gctagttatg 

gtggacacgc 

atcccttgag 

cacctatatg 

gaggggaact 

gggagcctgc 

gggactcaga 

aggatgttat 

ctggagccct 
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gctagccccc 

agacatcatt 

acactgtagc 

gagccaccat 

cttactcact 

agaatggtgg 
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gcagggggta 
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acgcttcttt 
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ggaggtagag 

agggaaagag 
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ttattaacag 

ggggcaatta 

tccaatcaaa 

tgtcccatta 

gaaggtgaga 

ggtcatgagg 

aaggatgcac 

tgaggcagga 

tcaaaataac 

aaaacatttt 

gactgctctt 

tccataagga 

tataataata 

ttcccagcaa 

taaaataaat 

agaagaaagg 

cctgtagtcc 

aaggtcaact 

acatagccag 

cagaaagaaa 

cccccccccc 

gccttcagat 

aagcctggga 

atatgaatgc 

gaggaggaag 

gtgtgagagg 

actgaggcag 

ctctacagcc 

attggctccc 


ttgggggtgg 

ccactagcaa 

gactctcctg 

caccaggata 

aatgccaatg 

agtctactgc 

cttgctcaag 

gatggttacc 

gcagatccct 

cgggtttctg 

cttgctcacc 

gatgttgact 

gcctggtgtg 

cctgtctgtc 

ctgtatgggt 

tgtcactgag 

gtcttttttt 

tgtcttcaga 

gtggttgctg 

gagccatctc 

gcaggaggat 

gtcttgtgca 

agagattgaa 

agcagcagaa 

gcatggagcc 

gtcaaattct 

cacaaaggca 

atgagagctg 

ttggaggagg 

tagccaccac 

ctttttttct 

agctgtaaac 

atcaggaagg 

atcgagagag 

gagttattta 

cctgaaagga 

agagagcaga 

ggtgcttcag 

gaagctgaca 

agggagggac 

cccagggttg 

ctatagctag 

ggatcaccat 

aacaaaagga 

ttttaaaaaa 

ccgaaggttc 

gatctacgcc 

aataaattct 

ccacatgatg 

aaataaatct 

taagaagcta 

tggcactcag 

tgggtgagac 

gcatgatggt 

aggaattcaa 

ccagagacag 

ggctcagcaa 

cccaaggtag 

tggggcatgt 

aaaaacataa 

agggtcrtggc 

cggaggaggc 

tgtcctgatc 

aggctcctgg 


ggcagatctt 

agtgccagac 

gtggcaggca 

aagaataaaa 

tggcccagct 

tgctgccttg 

ccgagtccct 
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tgagtgtgtg 
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aaagtcttct 
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gagtttaata 

accagggctg 
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cagccttctc 
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tcccaacact 

tctttttctt 

caccctggtc 

agggagggag 
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gagagacgga 

aagagatacg 

gtattttcca 

ctggatggag 

acagaccagg 

gaaggacccc 

gcgtggtggc 

gagtgtatgc 

agtgggggtg 

agaaaggggc 

tgagttcaaa 

ctcttctggt 

ggagtgaggg 

gctcacaacc 
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gtggggttgg 

cttgtctcaa 

atacatttat 

gatcaggctg 

acagacagac. 
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acacctgtgg 

ttggctcttt 

gggttgcccg 

cgcaaatggg 

agcttggaag 

actccagttt 

gaaacccgca 

aggctggggt 

agaggcagag 

accccagcaa 

tactgtgtgc 

tttcttctga 

gactctccaa 

cagttctaca 

tcattttcct 

tgccaggggc 

aagactgcct 

agtgctgagc 

ctagcatgtg 

ggttcatgct 

ccatcctcag 

acaaacaaca 

ccatctcaaa 

ctatcaagtc 

gatggctcag 

gcacccacat 

acacagatat 

aaattggaag 

gaagggggag 

tttggggttc 

gtgggggcca 

tattactatt 

ggatgtctgt 

tctggtagtc 

aaccttattt 

gaccctcctg 

tntccggatt 

ccgctggncc 

nnnnnnnnnn 

nnnnnnnnnn 

agctctaact 

caccagcaag 

tggaaacgga 

aggactctgc 

tgctttctga 

gcaatgaaac 

ttatttcctt 

aaaagaaatt 

tctctctctc 

atgaatgcac 

gcaaatggtg 

aatccctttg 

tcctcctcct 

tacccttaag 

tttgatgaag 

agaagggttt 

gcttttggct 

ggatgcccag 

taaaagcagg 

tgaccacagc 

taatcgcttc 

ttattccagg 

gtgccggcag 

ctggggggaa 


agtgctaggt 
ggtaatagac 
tggctccctg 
agagcatgga 
cagcaggtgt 
tttggcacaa 
gttgaggaac 
gtgaagaaga 
tccctgagtg 
acaaaatccc 
cactaaccat 
gactgtgata 
cccc agctga 
ctggttgaga 
gtccccaaag 
caccccatgt 
ggtgggaggt 
ggaaaatggc 
caaggaaggg 
ggtattctca 
ctacatagct 
acaaagcagc 
aagcaagtag 
atgtatatat 
cagttgagag 
ggcagctcac 
ccacgcacat 
taaaactctc 
ggcataggtg 
attatgcaat 
aaggtctcct 
agtgtgtggt 
ggtctgagga 
aggtttgcaa 
tttgcattga 
ttcctgcctc 
gctgcncacc 
ctcccccccc 
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gctgttaaat 
gcccaggtcc 
cggaaagggg 
tccttccccc 
ccgcttagcg 
actaatgcag 
tatgtagaaa 
tctcttatga 
tctctctctc 
acaaagtagc 
gtgactggga 
cattcaaacc 
tacctgtctt 
gacaattata 
tcgatattaa 
gggttcgatg 
agggtagaaa 
agaaggtgga 
gatgagctgg 
tcctgccctc 
agaaagccag 
caaggactga 
ggcgggacct 
aggctaccgt 


gatttgctaa 
atgcctagca 
cagctggggt 
acagaggctg 
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gggctccctg 
atccataatt 
cagcctaacc 
gtggcgggga 
gccccttgtt 
acactcacgg 
cctagacaat 
ctccactagg 
gagataagga 
agaatgggct 
cctgggttcc 
acattcagga 
agcttgagac 
agaaggccaa 
catgtatctt 
gcatgtatgt 
cacttggatg 
aactgtctgt 
aaacaccaat 
taaggagaca 
tgtggtgtgt 
gtatacatgt 
cctcaatccc 
gtgatgtgtg 
tgctctctta 
agctagtgtc 
acatggaact 
ccccntgtcn 
tccccgccnc 
cccccccccc 
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tcacactcct 
agcttgactt 
catggaagca 
catgcctgcc 
gaagacaagg 
tgggcttttc 
gtgcatcttt 
actaatagca 
tgtctcctgt 
cccccccc at 
agtctggcca 
cgggctgctt 
cccatccttt 
ctcttccatc 
agagatgctt 
tcaaagtttt 
agagatggag 
aggatggggg 
gcgctaggag 
ccctccccca 
attggaatgt 
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agctgtgcag 
tttatctctt 
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gtctggccag 

tggatgctaa 

gcctgtggcc 
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ggtgccatgg 
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ggtctcccct 

tgaggtttat 

ctcccccact 

cgggttggga 

ctgcagctca 

ctctctcagc 

ggctggtcca 

ctggcaatgt 

aagggatggt 

gggaggtaac 

atccccagca 

ggtacagtca 

cagcctgggc 

ctggcaagag 

ctttcatttt 

ttgtatgata 

ctctttcaaa 

aattccagtc 

gcacataaaa- 

aaagggactg 

ttaacatgca 

gtgggtgcaa 

tctctgcctt 

tgggtgtgtt 

ccatgttcgt 

tttatctgcc 

tcctgagttg 

nnntcacang 

ccnagcctcc 
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nnnncagaca 

tcacatcccc 

agagcaaagc 

gaaccagaca 

gtgcaactgg 

ggagcctgtg 

attaagccag 

atatgttgtt 

ctgatacata 

gcatgtgtgt 

attatttctt 

tgcaattctt 

gctgtggcca 

gcagacaatt 

agcaaacacg 

tatttacata 

aaaatcttaa 

atacttattc 

agggagggtc 

tggaaccagt 

gtctctctgt 

gttgctcacc 

tcatggcatc 

aaacagctcc 

ggcagatggc 


catttgccca 
gcttggaggg 
ctgaagaagg 
gcaatatggg 
caggcagatt 
aagaggcaag 
gagcccatgt 
catcccctcg 
tacctaacca 
agagacctca 
gtcttctcta 
gtgtgtgtga 
gtaacacagt 
tccttagaga 
tttgtaagct 
tctctgtcac 
ctcccctggg 
tctgttgcta 
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ggaagagcag 
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gagtattacg 
tttttacatt 
tgaaagtagg 
gaacctgggt 
tcaggggatc 
taaataattt 
aggggaagtg 
gaatacactt 
gatgtaagct 
attttcattt 
aagccctcac 
gtgggttctg 
gagccacctt 
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ggtgttcttt 
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ccccagccgg 
ctgctgattt 
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agatgtgata 
tgcctaggcc 
gtgtgtgtgt 
tccagactca 
ttgccagccc 
ctggagtcct 
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ccttgagtct 
gctccctttt 
ggtgtagagg 
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gccttacctg 
gttctcgaag 
cccccgctgt 
tcggaaattt 
gtagagtggg 
gaaagaggtg 
ctggcttccc 
tattaggacc 
cctggcttct 
ccctaaaggt 
ggacgctcct 
tactttcttc 
ctgagtgaga 
aacaggaaag 
ttacctttcc 
gcttcaggct 
aacaagggga 
agccagacat 
atctgccatg 
atgctggcct 
gctccttgga 
tcttagctgc 
agaccctcca 
tccacaattg 
gcaaacccca 
tgagccctgc 
gatccatgct 
ctcatcctgg 
gattaaaatc 
acaaagccct 
ttctaagtgg 
cccccccccc 
ataattctcc 
gaagcacgct 
ctctcaagtc 
atttacactt 
ctgaggactc 
ccctggcctc 
ggtacaccgg 
aaaggcttcc 
aaggtaaatg 
ccttgttacc 
ctgctgagct 
gtctccccac 
ttcatttgca 
tataagaaaa 
ggccaaacct 
ggcccggcac 
gatgaatgaa 
tgtgtgaggg 
tcctggggcg 
tgtctgtaac 
acgtggtcca 
gcctctgaag 
aaaacaccca 
caaagtgttt 
ctcagagtga 
agattcacgg 
octet ct etc 
ccatgtagga 
tgaacgagtg 
aaggcaagag 
ggttctatct 
cccaaccaga 


ttactgcctg 
ttagaaggaa 
gtttctaaag 
ctttttaatc 
agttccgtgg 
ttgtgggcct 
actctcaaga 
agagggactg 
ccaggagccc 
atggggatga 
gtaccccaga 
tgttgttgtt 
aaaaagagga 
ggcaggcaac 
tgacccccag 
aagtaaacag 
ggaggcagtg 
ttcaggcacc 
cctgccatct 
ggaggaaatt 
aacccacgtg 
ctcaagcact 
gggagteatt 
aggagaegee 
gccactgagg 
ctcccagggg 
tttatgacat 
ggaccaggga 
attttttttt 
cctgtgctga 
agtgtgatgt 
aaacttactg 
ccatctccct 
gaagaggcca 
tacacggagc 
catgccgcag 
catcaagcac 
cttacagatc 
gggctgcagc 
taggaaatca 
gatgttattt 
atgeaggget 
gagggagggt 
ccccg ccccc 
gcagcttaca 
agaggaatga 
ggeegagaga 
aacagggccc 
tggatcatct 
agatgtcaac 
tcagcttcag 
ccgcagaggg 
gacccctctt 
gtatgtcacc 
aacgaaagaa 
ttcagcatga 
gatgeggagg 
ctggcagaaa 
ttttcaaggt 
cctggggcct 
ataggaaccc 
caggaggcat 
gggaggeagg 
gagaattcaa 


cacttgactt 

aaaaaaaaag 

agctgttttc 

atageaggag 

ccacagagag 

gtgcacacat 

tgaggtgtgt 

tgtgtgtgga 

acatgccaac 

gggaccaagt 

ctcagccgcc 

tttggaagtg 

gagggaggaa 

aagacaatga 

ggcttagcca 

ggaagagttg 

gccaggcagc 

tctccttccc 

gagagaggee 

ggtctttagg 

acagagctcg 

gtaaggttta 

gcctaccctt 

agacctggca 

aaettgeata 

acaactgggc 

agaaaggcct 

aggegageat 

tctgctccat 

aattagaccc 
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ctgttagagt 

gggtcagtgc 

tcagcagcca 

gagactgatc 

ggaatttgaa 

aggcaaagtc 

gaaccaccct 

ccatgctgta 

cttggaactt 

cagatgtacc 

cagccaccct 

tcctgagagc 

aaccctcctt 

atttctgaag 

atgccgaaga 

agggg cac aa 

gaaggtactg 

acccagcttt 

ccctcacctg 

aaagctgcca 

gcagcagctt 

tggggtgatg 

gcccagtgaa 

ctccccaaga 

tgtttactga 

cacacacaca 

cacacacttt 

actccgttct 

tcgagacagg 

tggcctcgaa 

tgccaccacc 

tgggtttagg 

ccatagataa 

tcttcttctt 

caccccctga 

gtcccaggca 

ccaaggcttt 

tgccatttat 

gcaagaaatt 

cctgattcat 

tcctaacaga 

ggagttctag 

ctaatccagc 

cgggctgggg 

cgcttgcttt 

atggctctgc 

caacagaggc 

acattggtca 

gcctcagtga 

tgaagggcct 

aggag c atga 

gtaacactgt 

aacatgcacc 

ggtctgcaat 

ctgctataca 

caaaataaaa 

tccctgatca 

tggcagtacc 

gtatcctaaa 

tggaacgcag 

tagggcttgc 

gtcttacaga 

gtatgagttt 

accatccaat 


aggctagaag 
cctgggccca 
gtgaaccctg 
acctaggaga 
agttgcccac 
tccttggccc 
ctcgggctcc 
gggccccggc 
ggaacaaagt 
cggaatgtgc 
acccatacct 
ccctgctgac 
gctgctgaat 
tacatttttt 
tgctgggctc 
agcaagtgca 
aaagtcagct 
cctgtttttt 
gctgagccg c 
gttgaagaac 
tgcctatcat 
gctttatggg 
agatttaggt 
ttagttgatg 
gacttccttg 
cacacacaca 
cctttccatg 
aagtagtcat 
gtttctctgt 
ctcagaaatc 
gcccggctgt 
ggtacttacc 
gtaggcagga 
ctcgtggatg 
ctactcaccc 
agctcgtggg 
ccctccacct 
tttctgcctt 
gtcaggggag 
tctcacgtat 
agatgggttc 
cctgttcccc 
cccactgcag 
aggctgagtc 
ctttcctcct 
tgtcctcatc 
aggcacaaga 
gagaggcacg 
aagacacgtg 
ccaggaagta 
gccacctctg 
gggaaaacat 
aagccctggt 
cccggcactc 
atgagttctg 
taaagcattg 
taatacaaca 
aaccattgtg 
ataaggattt 
ctgtttgttt 
ctaagtaaat 
agaaaaattg 
gtgactgttt 
tgagtggaaa 


ctgtcatggt 

ccccgccccg 

tggcctgaac 

aggaagatcc 

cctagtgtaa 

ttctatctgg 

cagccctcta 

ttttactgct 

tcagacgtgg 

aagcggaatg 

ccctcccctc 

cttccaactc 

tccctaagaa 

tttttaactt 

cttagcaggt 

tttagaagcc 

agagccaggt 

tttttcctcc 

agtagttctt 

tgttgccctc 

ccggaaggtg 

agggaaaccc 

taaaggcact 

tctgtgtggc 

taggcctgcc 

cacacacaca 

aggtccaaaa 

ttgtgtattt 

atagccctgg 

tgcctgcctc 

atttacattt 

cttacacctg 

gggcattaaa 

gaaacgaaac 

agctcagggg 

aggctagtcc 

aaagcaaaac 

tgctttccag 

atgggttgta 

gggcttggtc 

agggggtaca 

atttgtgaga 

tctaagctga 

ggctcacaca 

cccttccctg 

caacatggag 

cagtggagga 

tagaagcctg 

cttccagcca 

tggtcccatc 

aaaggaaata 

atctatgact 

tctgtcttct 

tggaggtggt 

agccagcctg 

gttagtaatt 

agcaaatgaa 

agagatgcct 

ggttataaaa 

ggaagtccat 

tatattcatc 

agtggaaata 

ttaaaatatg 

tataaatact 


gccagagagt 

cagccaaggg 

agagctatcc 

gacaaagttt 

tctttccact 

cagtggccat 

gcctgccacg 

gattcatgcg 

aggggccggc 

cctggcatct 

ctgcctttgg 

tagtgcccct 

caagtcattt 

gggacttggg 

tgccaagagt 

tcttgcttct 

ttggatggcc 

ccttcctttt 

cagtggcaag 

tgcccctggc 

acagaactgg 

tggtcctctg 

gtctataaat 

agtgggaaga 

tttcttatct 

catacacaca 

gtaaatgtac- 

actttttttg 

ctgtcctgga 

tgcctcccaa 

ctttatttat 

tggatttttc 

agtccaccag 

agctcttcac 

gattaggatg 

tctactggct 

tgtagctctt 

gatagtgaga 

tgatatgagt 

tctgattgtg 

ggaggctgtt 

aactgaaagt 

gggataggat 

ttgcgacaaa 

gccacagtgt 

cctcagaggt 

cctggcctgg 

gagaacacca 

tctcctctca 

tctaccctgc 

cacagcaaat 

ggggttgtag 

gcattgcata 

ggcaaaggag 

ggctatatga 

caaagaaagc 

agccagaaga 

ttggacctgg 

tgttcatctc 

tccttttctg 

tgatggtggt 

taaatattga 

aacacatact 

gaatactaaa 


tgatggagca 

cacctgcttg 

tgggcagaga 

atacttccca 

ctctgaaaat 

gtccttggac 

cccccagccc 

ttggaactgt 

agacagcctg 

ctagtcctga 

tcagctgtcc 

cccatttcta 

gagttgatca 

ttctacaccc 

tgccagctcc 

tattcaagaa 

tctgggtcgc 

aggaacctgt 

ctttatgtcc 

ttcgtggagg 

ggtgggaagg 

gggagccctt 

tggggaatag 

aatagaagga 

tcatcatcac 

catacacaca 

tcaggaaggg 

tttatttgtt 

actcactttg 

gtgctgggat 

ttttagtctg 

cacctgtata 

tggtgactca 

atgaactgtt 

gaaggaaagg 

tctcaccatg 

ggttgggttc 

ctctgctcaa 

cccttctgct 

gttcaccttt 

tgttgtattt 

cataggggag 

gtgtaaggga 

gattgccctt 

gtccctccag 

gagaaagggc 

aaccacaagg 

ggaaagagag 

ggacctgcct 

agtttctata 

tcaaaaagag 

ctcagttggt 

aaactgaaca 

cctacattca 

gactgtctca 

agatgtggct 

aggctcctgt 

tagtttgctg 

agggttgtaa 

ctgtcatgaa 

gttctgtgca 

atactaaaaa 

tgtaatatat 

aagattatga 


gctggtcaga 

gcacaaactc 

gaagtggaca 

agaggctttt 

agaaatccca 

tgactgtgca 

cctccctgag 

gggggcgggg 

gaattcatac 

ggaagctgcc 

tccctcagac 

accctacaca 

cagagctcat 

tgccctttga 

tagtctgtaa 

cccctcatta 

tggccctgtc 

gcctcccaca 

tgacccagct 

aagaggagaa 

tctggacagc 

acccccactg 

gtgactccac 

aaagtctgtc 

catgccaaca 

cacacacaca 

ggacattgaa 

tgattgactt 

tagactaggc 

taaaggcgtg 

gcccagattt 

atggggaatc 

gagcctgggc 

gtccttcccc 

ctatggttaa 

catgggtggt 

tagcaaccac 

tactgtgcag 

gcctctagct 

ggcccagtct 

gacaggagga 

actagatcat 

ctgtagcaga 

ccctcgacct 

cactgggtac 

agcctggaag 

gcctatccgg 

cagccagcca 

tcctgggaga 

aacagcctca 

attcaaatgt 

aggtttgctt 

ggttggccca 

aggtaatcct 

aaaaataaaa 

gaaaccgttt 

gccttgtgtg 

tcttagaaat 

tagagaaaaa 

aatgtatagc 

gccatccaaa 

gattataaaa 

ttttttaaaa 

aaagtatgag 
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tttgtgactg 

aaagtggatt 

attttcccct 

ttaaataaaa 

tgcctgtgag 

gccgtctctt 

ctttgtgcac 

ctggagctgc 

ctagaagagc 

catgtttaga 

aagaaaatgt 

tgtgtgtgta 

ctccaccaca 

tacacacagg 

agaataagaa 

nnnnnnnnnn 

nnnnnnnnnn 

aacatagctt 

ttaattttta 

gttaatgcaa 

taaaagtcat 

atgcacacat 

taaataattt 

tgctgctttt 

ttagaaaagt 

cgtcagcaca 

caactcactt 

cacctgaagg 

aagccttagc 

tatttctcga 

caaaaaaatc 

ggttaggggc 

actctgacta 

ggttgtatag 

aaagatttat 

gatcccatta 

ttggaagagc 

taaaagttta 

tgtgtttgaa 

ttgattcttt 

cagctgagct 

agaaggaatt 

gtatgatatg 

cagatcacta 

taatcaaaat 

cctgtaacct 

aggacaacag 

gaacaaaacc 

actctgggac 

agcttccacc 

ctagcggaga 

gggtgcatga 

gaagccatgg 

gtaatgaatg 

gaaacttgga 

ttaaaagaaa 

aagcatcaac 

tcgggctggt 

ttcaaatccc 

ttctggagtg 

aagtattaaa 

ttttgttcct 

tgctaaatag 

tggcctaggc 


tttttaaaat 

caaaatgtta 

tgttttttct 

tacaaaagac 

tggcaggaat 

cactccctct 

cagagtgtgg 

acgctgtttg 

agcaaatgct 

ttttggagaa 

ttgcttggga 

tgtgtgtgtg 

ctaaagtcaa 

aagcattcct 

agaagggaaa 

nnnnnnnnnn 

nnnnnnnnnn 

aaagaattct 

atgggaaaaa 

acaggacaga 

ctatttatac 

acaaacagag 

taagctctaa 

gccattcttt 

aaggagcaca 

cagcaatgtc 

ggactggtac 

ctgcagcagg 

tgcagtggaa 

agtgaatgta 

caacttttgt 

acaagctccc 

ctacacctgg 

caaattttta 

ttttatgagt 

cagatggttg 

agtcagtgct 

aaattaattc 

gtatatatgg 

cctttcactg 

atcttgtggg 

catgtaattg 

gtaagaaaaa 

tgttcaagga 

gatcccatcc 

caagtccccg 

gatatctgag 

aacaacttca 

acagtgtcac 

tgggacacaa 

ggctgcaagg 

tgtgaaactt 

ccaacaaccc 

ctatgtctaa 

cagaaaagcc 

tttcacaaaa 

taagaggagt 

gagatggctc 

aacaaccaca 

tctgaaggca 

aaatatgcaa 

gttgtttgtt 

tggaagatga 

tactctgcct 


atgaatgcat 

agaatggttg 

gtctttctaa 

aaataaatgt 

ccaacattgt 

gtaattttta 

gtgcacattc 

gatcttctga 

cctagccact 

tttgcttgta 

tacctattgc 

tacactggag 

ttctatgaac 

ctgactttct 

tgctcaaaac 

nnnnnnnnnn 

nnnnnncccc 

agagaaaaaa 

gtaatatcaa 

ttatcttggc 

gagatgaaat 

tctttttatt 

aatttaaaaa 

ctgataaaag 

tcctatgagc 

ctggaccatt 

ggggtgaggc 

ccagttccgg 

cctatgcaaa 

caggcaccat 

gagacaggct 

aaggagtatc 

cttttaggtg 

ttcaccaagc 

acagtgtggc 

tgagccacca 

cttaaccgct 

tcattatgca 

ggatgtagtt 

tgggtcctcg 

ccaccaaact 

aaatagaaaa 

ccaagaaaaa 

tggtgtccca 

agacctgcta 

ggcagcacag 

aaggagtccc 

ccatgatgaa 

acactacagc 

tgtcacacac 

gtggagggca 

acaaagaacc 

cagcactctg 

agagatgata 

aaatgtaaca 

aggcacagaa 

caggaacagt 

agcggttaag 

tggtggctca 

gctacaagtg 

taattggaat 

ttccgagcct 

ccttgaactt 

gactcaggct 


acttgtaata 

tttttgtatg 

tttttaaata 

tttaaaattt 

cttctgaaag 

aaaaatatat 

tgcagaggcc 

tgtggatgct 

aaagccatct 

ggaggatggg 

ttcttgagtg 

ctaggaatca 

ttcattaatt 

gactgtcagc 

ctgtcactct 

nnnnnnnnnn 

tattttacca 

aaaaaagtcc 

agaaatccat 

caaattaaac 

gtgaattagc 

cttaagccta 

atagtgaaac 

tcccaaaggg 

tggtgacctg 

tgtgaggagc 

tggcgatggg- 

caggttttcc 

cacacaggaa 

ggcacgcatc 

cccttcattg 

tgctctccta 

ggttctgagg 

cagacaaccg 

tgtcttcaga 

tgtggttgct 

gagccatctc 

cgtgccacag 

atgtgacggc 

gagcgtcaat 

tatttttaaa 

gcagagttga 

acctctacaa 

gaaaagggtc 

cagagaagct 

cagagctgtg 

agtgagcgtg 

catcatgaag 

ttccacctgg 

tacagcttcc 

ggtacaaggg 

aataagttag 

gaggcaggca 

aagtcccatg 

aatgcatggt 

tgtaaggtga 

gtattttaac 

agcaccgact 

caaccatcca 

tacttacata 

tattctttgc 

gtttcatgta 

ctgattcccc 

cagtgaattt 


tattttttaa 

gtgggatagt 

ttgtgcattg 

ttactctttt 

cagctagcgc 

ttgtatgtta 

agaagagggc 

cagaatcgca 

ctccctctag 

ctacaccaag 

tgtgtgtgca 

tatccagggg 

gtctgaatcc 

cagctaagga 

ntggggnnnn 
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agcattgcag 

actaatgttc 

gtctacggca 

cacagtaaca 

gtgcgcgcgc 

ccaaataact 

cccttcaggc 

tgtcataatc 

agttttacac 

cccacgctgg 

caccactttg 

aaatggcttt 

aagggcagtc 

tggaggtcag 
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ggaagtgtaa 
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agagggagga 
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atgtgtggac 
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acagtcagat 

aacaggggtg 

gaaagagaga 

ggagggtctc 

agagttacaa 

aggtggggaa 

aagcgagcga 

tacctttata 
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taataagtaa 

aattctacta 

atccaaactg 

tgtctccacc 

tcaaacactg 
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acaattgtga 

ctttcatatg 

atgagtgttt 
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tgcttgtgtg 

ccttttcaag 

acctactctc 

ggtgtggctt 
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ttggttaaag 

gaggaggtga 
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aggtggggat 
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gacccctaac 

ctggggcttg 

ctgctgctgc 

gtgcaaaccc 

tggctcttag 

tttttgtctg 

gctttcagtt 

agtaaagtca 

tcatcaacag 

gaaaacctaa 

gtttacaaag 

tggtggaaaa 

gcagggagag 

aggaggggga 

tgtatttgcc 

aattgcctac 

tggctctgaa 

caaaggcagc 

acaggcagga 

cctgctgggc 

tttggcaaaa 

aatacattca 

acagagagaa 

agcccttcct 

cagtctatgt 

tgaaggaaag 

acctatcttc 

gcacctcacg 

tctccttcca 

cctgctgagc 

agcctcattc 

gactcaattt 

gacccgagtt 

gggagcagag 

gttcagtaag 

ccgctgcagt 

ttgaagctgg 

acagtaaaca 

tttaatccca 

tctacaaagt 

aaaaaaaaaa 

aaattttcat 

aagtctccaa 

agcagtcgct 

aagaggacca 

tgggatgcgt 

ggaacaagag 

catctttaac 

agttcatgta 

ctctaattat 

agtaaccttt 

gaattccaga 

agctgcccac 

aaaaccattc 

tgcaaatcaa 

tggtcccaag 

agctatcatt 

caaacccacc 

agcctgccat 

ctctgccagg 

ggtgaggccc 

caatatttct 

tatgaataat 

acagcacaga 


ttccatctct 

ctggctggtg 

tgctgctgct 

agtcaagaga 

aggagcagac 

cgcatagtaa 

ttaatgaggt 

ctgggctcaa 

gacggtgtgt 

aagcagttca 

ggttctaccc 

tatgagttga 

atgagaacag 

ccgagcacgc 

cttcaccctt 

cactatttct 

ggctgaagcc 

taaaaaaaaa 

agtctccctt 

ctcctgaggt 

ttactcttaa 

gcacaaagtt 

aaagtgagac 

ttacagctca 

gaagcatttt 

tagaaatgag 

cagcgaagtt 

aaggtgcttg 

ccagagtcct 

cctttcccca 

tcagaaccat 

ctataaaaag 

ttgttcccac 

gtaggcagat 

ggagcctgta 

ggcatctgct 

aggtcaaggc 

tatattttac 

gcactcggga 

gagttccagg 

aaaaaaaaaa 

tttttataaa 

cttgataatc 

aagtccgggt 

gaagttcaag 

gagacacacc 

ttaagtgcgt 

atctttggct 

cttcatcatc 

atagaacata 

atctcttgca 

gcattgtaaa 

aagacctcaa 

tacaaagcag 

gctatcccag 

ccatgaggga 

cttacagtct 

aacttcccaa 

ggtagtgcac 

agttccatgc 

tgtctcaact 

cagttcctac 

aagtatatta 

tacaggacac 
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gtgtcgctgc 

gctgctgccc 

tttcaacacc 

tgcaccacgc 

agactggttc 

tctccctgaa 

tctgtgatac 

tcgtggcagg 

attgtctctc 

tttcccccaa 

aagagtctca 

agttcagaag 

ttctaaagca 

tatacaaatg 

aaaataacga 

agcctcgttt 

aaaatgtccc 

gccactgagg 

cccaactcac 

gtagctcaag 

gagaagatta 

acctatttac 

gccttcccat 

taaaatcagc 

cacacagtga 

cttgccttct 

cagctatgtg 

agggatggca 

gcctctcctt 

ttctgctgcg 

ataactgagg 

atggggcgtg 

cccaggcctc 

accacagaag 

catagccaca 

cagcataggt 

tacaattaaa 

ggcagaggca 

acagccaggg 

aaaaaaccaa 

aattacagtt 

tgaggctggg 

agtcccttgc 

gtgctcctcc 

agtaacagca 

tactcccgta 

gagaagagaa 

atgcagttaa 

agatgtggta 

aatctggtac 

tagcagcaat 

atcaatcagt 

taacgtaggc 

gaagccagat 

actagattac 

attatggcaa 

gcaagcattt 

acctgtaatc 

cagcctggtc 

cccaaaccac 

tgacagaaaa 

tacagtacct 

acacacagct 


cttacccatt 

agaggctggg 

atatcctgga 

agcagtaaat 

tgcacaccct 

catttccctt 

gccttgccca 

taccactatt 

atccattaga 

aagtgctttg 

acaaagcaac 

ggtaccgtga 

cagccgtgtc 

gcaaataaga 

tttaccaagt 

gtttatgggg 

aggctgctca 

agagctcctg 

ctctcccttc 

cgggcatgct 

gaatgagacg 

aatagaattg 

ataggagagg 

tatcactcaa 

aagagaaagg 

cctgctgcat 

ttaaaaagaa 

gagggcagag 

caaaggcacc 

ttctacagtc 

catgtctgac 

agccggcttg 

cgtgctagag 

aatagccagg 

acactgtgcc 

gctacccagg 

accataggta 

aaaacaaggc 

ggcggattte: 

ctacacagag 

aaacccaaca 

ataattttta 

ggtgtagctc 

ctgtatccca 
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gggtgattta 

aagctcctcc 

actatacaca 

tttttttttt 

aataaaccta 

ttttgactgt 

agcaggtctt 

tggtcagatt 

ctgtttactc 

ttgaactcac 

aagtgctggg 

agtggcttcc 

gcgtttctct 

tgaagcacac 

atgcaaaaac 

ggcattccct 

cactcagtgc 

ccagcattat 

acatagtgtt 

tctcttgaaa 

taacaaaagg 

tccaagggag 

cccttatgag 

atgaattaca 

tgaccaagcg 

gccctgctac 

tggtcgaact 

ttccgcagat 

ggcaagtttt 

caatatgctt 

catacttact 

acaagtgctg 

agggcaaagt 

atttcctcag 

cttctcacaa 

ccggtcacac 

gtcctgggtt 

cacagtgctg 

ggtctggaac 

tagctcccac 

tgctacacgc 

tttggagacg 

atagagtgcc 

tcactctgta 
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atttgttaca 

tgcacatgtg 

cgtttgggga 

cgacactgac 

cacagcgtgc 

tgtgagtgag 

gttcccttgc 

tcagtaagtt 

tcagtctcta 

aagacggaag 

tgacaggcaa 

tcttctaaat 


gcaggtacat 

gagaaaaaaa 

aatcttaaaa 

aggcccccag 

acagccaact 

atatgaaaac 

ctatctgtgt 

aacagcgtgg 

tccgtggcct 

ggtaccaatc 

accttagcta 

gtgtataaga 

gtggtgtgca 

ctccttccac 

tgcttggcaa 

taatgtatat 

tgtggacact 

acaacagaca 

tatgaccttc 

ttttttactt 

ttcaggagac 

aaaaaacagc 

gacgcattac 

gcgcatccac 

cacagcgcgg 

ctgtccaagg 

atctgcaata 

aagacacccc 

aaatgacatt 

gaagtattca 

ggagccactc 

taggctatgc 

atgcaggtaa 

caatctgcta 

gtcggagcat 

taatgagcct 

gacatctttt 

gggcagccac 

gggttacaga 

cttccttatt 

cacaccccca 

tccacactac 

ttctcacagc 

gggcaggcct 

aaatagtccg 

aggcaactca 

gctcacgagc 

agaacccaaa 

gcggctcact 

tggctaccac 

gaccaggctg 

aaggcatgtg 

tatgtgtatc 

tgcaggtcag 

actcaggttc 

atgtaatgat 

atgtgcaggc 

ggagaggaac 

tggccaagca 

tttccccaag 

ctgccactcc 

gtgagccaat 

ttcttggact 

ggaaaaatgt 


ggttatggaa 

aaaaaacaaa 

aaccttaaag 

agcagcacat 

ataataagct 

agagggcaag 

ggaaggactt 

agggcagaaa 

ctacacgcat 

cccacgaact 

gtgtctttct 

aggcaaatga 

tgccaggcgc 

caccccaggg 

gggcttcact 

atttttgaga 

agctggcctg 

gcaaccatca 

tcctgacaac 

caagctttga 

accctgtagc 

gatggggatg 

ctcacaccag 

cacacccaaa 

acaaacacaa 

atggcctctg 

tttgcctaaa 

aaggacgagg 

gccgccacac 

cttctggtca 

aataccttta 

aaatgctctg 

tgctacgcca 

gaggtgtgag 

cagacagagc 

ccatggtaat 

agttttttga 

agctggccct 

catatgccgc 

tacaaagcag 

agggtctcac 

tcaccgataa 

ggtacaaccc 

ggtacataca 

cctgcaggcg 

aagctgctct 

gatatgcaga 

gaagtaacaa 

agctagcctt 

acctgggcac 

gcctcaaact 

ccatcagcgc 

agtgtgtatg 

aggacaactc 

caagatagca 

tcagtgtgca 

taggaaacaa 

tcaggctgtc 

ttatttataa 

acaagccaga 

tcaacctgtc 

aaaatcttac 

gaatgcgttt 

ctggttaagt 


acagctttat 

acattctttg 

cagacgccac 

tccctgaggg 

cataaaaaat 
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ctcgggacgg 

aagaaaggaa 

ccaagcgata 

agaaaacaca 

tatgtttggc 

gtaaaggtaa 

attcgtggag 

gtcgcagggg 
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cagggtcttg 

gaggtctgag 
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ctccactttc 

tctggagtga 

gggcaaaggt 

atacatacag 

gcaagcattc 

agccaacctt 

cagactcctg 
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gaagattatc 

acagcccccg 

ccccgtgttt 

tcataaagtg 

agcacactta 

gtcttacgcc 

tttagacctc 

tgtcccctcc 

agtgggttcc 

gacagcaaat 

gaactcctcc 

catgttcagt 

cagccaagcc 

cgatggcagt 

gatcatgcgg 

attctgctgg 

gttctctttg 

aaaggaagac 

tccttgcaag 

atcactaact 

accggcactc 

gaactcagag 

tttgttttcg 
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ctgacccaat 

acgtccatat 

tcaggagtca 
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cacagtggtt 

cctgtgggag 
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gatggtgatg 
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tgtggggcag 

tgacagtaac 
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gattttcaac 

tgctctttgg 

aagccttgtg 
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actatctgct 
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tcccagggat 

ccccaagggt 

ctatgtagtc 
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aacagctcag 

ctctttgagt 
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ctattatttc 

ttttcctact 

actctttggt 

cacttttagt 

ggctttgtgt 
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tctctctcct 
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tgcgtctcct 

gtaagtgacc 

ctgcagccca 
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tatttgaaat 

tcaatcactt 
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cgccatcagc 

ctggtttcag 

acgcacctca 

tttggtctgc 

agacagggtt 

cctgcctctg 

tctttttatt 

gtatatgact 

gttctctcct 

ctttaacagc 
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aatcatgtca 
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aggcatttgc 

catttaaaga 

agtggaagag 

cgaatacata 

agctggttgc 

ccaggggatc 

tgcaggatgc 

gtggatttga 

tggaacatgc 

gaaatagctt 

tcaaagccca 

tccgttgtct 

gatgtttaat 

ggatcgagca 

aataatggag 

tccaagagaa 

cagtggttaa 

gtggtggctc 

ggctacagtg 

aggaagagga 

gccagtctgt 

gctacattat 

caggcctgag 

aactcaagcc 

cctggcccag 

cggcccctcc 

caaacctagg 

tgtgctgtga 

cacctagtcc 

ccctcctcct 

aactgggctc 

cccctccccc 

aagggacatt 

gcccaatcag 

ctcagtgaga 

taaaattgac 

gaaggtagca 

ctacacgatt 

ccaacagagt 

aatccgtctc 

gacggctcat 

ccactcttag 

gaacttgtag 

tgtgatcaca 

ggggttgtgc 

gtctctggtc 

agacccacac 

taaatattgt 

ctgtttttct 

ggtactacag 

ctcaatcctt 

cttaaaaagc 

tgggaatgac 

cagtgacaac 

attctacccc 

cagcgcgcag 

taattgcttt 

ggctttcagg 

ctcaatagca 

ctctggttta 


gtggatacct 
agtacctaca 
gacccccacc 
agctgttatc 
cactatgcct 
gacgaagcta 
cttgcttagc 
agaaggctta 
ttaatttccc 
tgatgtcctc 
ctcgggccac 
gtttggtccc 
atgctctacc 
ttaggaggat 
atttaaaagt 
gtctttagtt 
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cagtggcgag 
cgcaggttag 
gagcactgac 
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tacttacata 
ccactgagca 
ggcatggcag 
gt ccctctac 
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cctgccgttc 
gttctcagat 
actcctgtgg 
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atgcaccagc 
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ccacccgcaa 
acattgacag 
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tttgtttcat 
ttatagaact 
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ttgagtgctg 

taagagagca 

tcctgtgata 

tgagacagtg 

cgatattttg 

accaaggtag 

tgctcttctg 

gtaataatat 

taataaataa 

cacattgaaa 

caggaattac 

actggccgtg 
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ttagcctcat 

tcagagatgg 
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gccgctcaag 

cagagccagg 

taaagatcgc 

gtattttgtt 

gtgaaatatg 
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gccttaagtg 

catcagactc 

aaattctacg 

tccttaagct 

ccctcaggat 

gtggaggttc 

caacaaagag 

atgtgaccag 
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cttgccgtac 

ctatgttcat 

agaagacata 
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agcacagata 
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ctagcctgaa 

ctgtgtagtc 
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tagacagctt 

tgttggtttt 
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aaggaagagt 

aaacttaaaa 
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gcaaagtttt 
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ttttcaggtg 

tgtatgccca 
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cctatattgt 
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ggtgcgaggt 

ccagttggtg 
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agccacgaag 
ggaccgagag 
gctgccgaga 
aagaagaatg 
acagacctcc 
cacacagcga 
ccctcaggac 
accttccctg 
ttcactgtga 
aagaacactt 
gcaagcacgc 
ctcaaccctg 
tgtgaaatca 
ctcattattg 
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cttataggcc 
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aggtcttcaa 
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tgggtaagtg 
catagtcatt 
agtaatacac 
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gtcttcttac 
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tcctttgatg 
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aattatgtat 
cacaccagaa 
tgggaattaa 
cccttttttt 
ctgtatgtag 
ctttggagct 
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ctggatgaga 

gccatcacgg 

catcactgac 

ggactctaag 

ccgatcatct 

taaagcccca 

tgtggtcacc 

gttgttcgac 

gtctgaaaca 

gggacgaaag 

actctttgcc 

caccgatccc 

ggacagaagt 

tgttagcgat 

gggcaagcta 

tggctctcac 

tcaataaggg 

tgcagaagca 

gagcaggcca 

ccctcacagg 

agctctggta 

aaacgaggga 

ccctagttta 

ccagcctttg 

catttttaga 

tgatgggctg 

acgcccgcaa 

aacaagtcat 

atggataaga 

tttctgatac 

ggaagaagag 

cgtgtgtggg 

gtgaccagga 

gttggtgctg 

agcagtctct 

gaagccctcc 

acccgccctg 

agagcgctgc 

gacgcgtccc 

aggtctccaa 

cgnnnnnnnn 

nnnnnnnnnn 

agatatagat 

gagggcatca 

actcaggacc 

tttttttttt 

ctatgttcac 

ggacttaaac 

accgccttgc 

ctcagtcctc 

attctgtgtg 

tgattcattg 

ggtctccctg 

catagagccc 

tcagagtctt 

aagacctgag 

caatgatgct 

cacagatgct 

cggaagatag 

ggttggttgg 

aaaaatgtag 

ctgaattttg 

ggccctttgc 

ttttagctta 


136740 

136800 

136860 

136920 

136980 

137040 

137100 

137160 

137220 

137280 

137340 

137400 

137460 

137520 

137580 

137640 

137700 

137760 

137820 

137880 

137940 

138000 

138060 

138120 

138180 

138240 

138300 

138360 

138420 

138480 

138540 

138600 

138660 

138720 

138780 

138840 

138900 

138960 

139020 

139080 

139140 

139200 

139260 

139320 

139380 

139440 

139500 

139560 

139620 

139680 

139740 

139800 

139860 

139920 

139980 

140040 

140100 

140160 

140220 

140280 

140340 

140400 

140460 

140520 
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tgctcctatg 
aatgcctctg 
caggtgaatc 
gatatgtagt 
aaaaagaaag 
agaaagaaga 
tgttaacata 
tttaaattat 
tgctggccac 
cagatgtgag 
ctcttcgcct 
gtcgttcttc 
gaactgcatg 
tagagtcaca 
cagtgttctg 
ctgagtcatc 
caccacgctg 
ctgagtcatc 
ctgtgttatt 
cagagat ctg 
attaaaagaa 
gggaagaagg 
acgtgagctc 
gctccacgct 
ggttcagagt 
actttccgaa 
tgtaatggtg 
cccatggaaa 
ctcaggctca 
ctgcagggaa 
catacacata 
catcagtcta 
atggagaggg 
tgcaatttta 
gtaaaagtaa 
tcataaccag 
tgtgagaatg 
ccactctttc 
caagtaattt 
agatctaata 
aatattttta 
gggtcaagcc 
tgaaagaatg 
tttgttagta 
aagagttgca 
cagccccata 
aacagcctag 
tgactttggc 
atccttgggt 
caagtgggcc 
tgagttcagt 
tctgacctca 
cactcgggag 
atttacagtt 
tcagtggtag 
agagagagag 
aaggaaggaa 
gaaagaaaga 
gaaagacaaa 
gagggcatag 
cctagcaccc 
actactgtgt 
agtaatctgt 
ctagcctaga 


ttagtaaata 

gctgagtttg 

tctgagttca 

aagactctgt 

aaagaaagaa 

aaatgagtgg 

ctactattaa 

tattttatat 

cacacataca 

acacgcctct 

ctacaatcaa 

ctggccagtt 

tgcacttggt 

ggttatgtat 

gatttaactg 

tctccagccc 

gatttaactg 

tctccagccc 

ctaactcttg 

tatgggctta 

tagagtgagc 

tcaggagttg 

tatgagaccc 

accacttgcc 

attgtgcaca 

aaatattaaa 

taagaaagta 

gttgcttaat 

cgagtgctta 

tccaaaaccc 

attttaaaaa 

ggctacacaa 

agatgcactt 

tagagagctg 

taaatgttac 

tgaagtaccg 

tgtttttctt 

ccttcacgat 

tccatagcta 

ttggcataat 

tgtacagcca 

tctgagtgac 

tgggaatgct 

ttatacattt 

aaaatatatg 

atctaccttg 

tatggtggcg 

tccgaaagcc 

acgtcactca 

aatgagatgg 

cttcagattc 

gtacacatgt 

gcagaggcag 

atataaagaa 

agcgcttgcc 

agagagggag 

ggaaggaagg 

aagagagaaa 

gcaaagcaaa 

atcagtgtta 

taccgcaaaa 

actttcacta 

tcttggtttt 

acacacagag 


gctaggtttt 

gtggcacttg 

aggcagcctg 

ctcaataaag 

agaaagaaag 

atttgccata 

caacaacaaa 

gtatggtgtg 

taggcagtga 

cctaggccta 

gctctcccaa 

gagcgcgcac 

acccagggag 

gggttctagg 

ctgagtcatc 

caccaagctg 

ctgagtcatc 

caccacgctg 

cttccttgga 

aaacaagaat 

ctaggagttt 

aaggctagtc 

tgtctaggag 

taacacgcat 

ccctgtatct 

gctacatcat 

tttggcttca 

gaaaacaggt 

ctgcccttgc 

tcttttggcc 

taaaataaat 

gaccttgtct 

cctattccat 

ctattttctt 

tttggaaaat 

ttaacatttc 

ttacaaaatt 

ttattgtacc 

actctgtgtt 

atttggcatt 

gtgttttgtt 

gctgaattgt 

catttcttaa 

atttaggggg 

gtgtggtttc 

ttaaatatac 

ttaacctcgg 

ttcctggaat 

tgtgcacaca 

ctccatgtat 

tgcaagataa 

catggtacgt 

gaggatctct 

actctgtttt 

tagcaagcgc 

agggaaaggg 

aaggaaggaa 

gaaagaaagg 

actaaataaa 

taatgcttac 

tatttgctct 

gcaacataat 

atttttcttt 

aactctcttt 


tagctagcca 

cctttagttc 

gtctacataa 

taaacagcca 

aaagaaagaa 

gcctagtcaa 

tactttacac 

ggaagctgtt 

aggttctttt 

tgtaagcagc 

taaacgtgtg 

aagagtatgg 

cccagaagag 

aatcaaaccc 

tctccagccc 

gatttaactg 

tctccagccc 

gatttggaag 

accctgagaa 

atgtccaact 

ggcacactcc 

agtcttagct 

aggagaaaga 

gacgcccaag 

ttaaaagtca 

ccactctaca 

gttttatggc 

aaaaataaga 

agagaccatg 

tctgcaggca 

cttttaaaat 

caagaagaaa 

gaagctgcta 

tcttttaaag 

atagagaagt 

tgcttctatc 

gggataatgc 

cattttctca 

ctgttacaga 

atgatgctat 

tgggatgaat 

tctcaggaaa 

gcaacactgg 

ataggaaata 

ttttcctgtc 

tgtatataag 

gttacttgag 

tccagctcca 

tacacatttg 

aaaggcagtt 

caggagagga 

gtgtagtatg 

gagttttagg 

gaaaaacaaa 

aaggccctgg 

agagggagag 

gaaagaaaga 

gaaagaaaga 

atacatacaa 

ctagcatgcg 

gtcttgctaa 

gagaatgttc 

gccaaattga 

atgaactcaa 


ccttgctagc 

cagcactcag 

caagttccag 

gaatgactta 

agaaagaaag 

actagatttt 

taaccatgac 

gcagagtggc 

gccaagacaa 

accagttctg 

cagaaggatc 

gttttcacct 

ggcatcatat 

atgtcctctg 

caccatgatg 

ctgagtcatc 

caccacgctg 

cagaactgag 

aattccttct 

cttgatctct 

tatgttccaa 

ttgtgacaag 

aaggggtaaa 

gtttccgttc 

atagtcaacc 

aacttgtaaa 

ctttgcatcc 

caggagggac 

gcaggcagtg 

accacattaa 

gagctctaga 

gaaatgaagg 

ttttggtgat 

taatgcttgt 

ataaagagta 

cggccagcca 

tgcacttact 

cagtattaaa 

gaagaaatgt 

aataagcatc 

tagcacaaga 

ggactagtta 

ttattattac 

tgctcatata 

ccctagttta 

gcatatgtag 

aactgctctt 

agggatgcag 

gtttttaatc 

atgcaaaggt 

ccaacccctg 

cacatgcaca 

ccagcctggt 

acaggggttg 

gttcagtcct 

ggaggagagg 

aagaaagaaa 

aaggggaagg 

tatattaaat 

taaaactctt 

attatattgc 

actatcccac 

tgggtgaaca 

gtttcttatc 


ttacaactag 

gaggcagaag 

cctggccagg 

aatattgcta 

aaagaaagaa 

cttggttata 

aagctatatt 

agttggctac 

gttaaccaat 

ggctcagggt 

ctgttgcagc 

acatttatgt 

cccctggaac 

gaagaacagc 

gatttaactg 

tctccagccc 

gatttaactg 

gctttgaaca 

attggctttt 

gattttatat 

gcttcaggaa 

tttgaggcta 

gtgaggtgtg 

ccagccctgg 

cttggaatta 

agcccttctt 

taaatgttac 

tgcaaagatg 

ggtctcccta 

cacatgcaca 

acgagtttga 

tttgctgtgg 

tatggtactt 

tgttttgact 

aaaaaaaaag 

gagtttttcc 

gttttgtagc 

ttttcagctc 

acttaattta 

tttctgtata 

gtaaagtgtg 

acatccattc 

atactattat 

gttcaatttc 

gtctcacttc 

gtgaataaaa 

gcagagaaca 

tgcctctggc 

ttaggaactc 

ctggatgaca 

cgagttgtcc 

gaagtcccag 

ctacaaaacg 

gggatttagc 

caactctgga 

gaaaggaagg 

gggaaagaaa 

aagaaagaaa 

tttaagactt 

ggcttctaaa 

tagttgtcag 

tcctctgtca 

agtatttcag 

cttttatgat 


140580 

140640 

140700 

140760 

140820 

140880 

140940 

141000 

141060 

141120 

141180 

141240 

141300 

141360 

141420 

141480 

141540 

141600 

141660 

141720 

141780 

141840 

141900 

141960 

142020 

142080 

142140 

142200 

142260 

142320 

142380 

142440 

142500 

142560 

142620 

142680 

142740 

142800 

142860 

142920 

142980 

143040 

143100 

143160 

143220 

143280 

143340 

143400 

143460 

143520 

143580 

143640 

143700 

143760 

143820 

143880 

143940 

144000 

144060 

144120 

144180 

144240 

144300 

144360 
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ctccagaggt 

tttggttggt 

ttttgttttt 

agaaactggc 

cacgtgccac 

ccagcattac 

ttaagagttt 

ccagaggtcc 

gatctgatgc 

taaataattc 

ggatcataaa 

cactgctgta 

cagaaggttc 

ttatctgaga 

tctcattcaa 

ctcttccatc 

gcagtggaaa 

acactgacat 

cacaaacaat 

ttccaatagg 

gattccaaaa 

caaagacttg 

acatagcagc 

cctctacagg 

aagatattaa 

gtcatcatgg 

agctgactgt 

gcattatatc 

tagtgcacgc 

gagtttacag 

taaacaaatc 

aggtccttaa 

ctggagttaa 

agaagaacaa 

actgaggcat 

gagctgctac 

tttttaaaaa 

aaaaactgct 

taaattttgt 

tcattttaaa 

tcatttttcc 

ggtttgatta 

gactgtagat 

ccctatattg 

gagttccttt 

gtcatctctc 

tcacacctag 

aatctcacta 

actcacagag 

aagcccagca 

ctattgagcc 

gacaggccta 

agggttaaag 

taataagaca 

cactcaggag 

aatgggaggg 

tgtaatttaa 

tcatggtttt 

tccttttgtt 

gaaagactgc 

ctccaggagg 

ctggtgcttt 

caagaaaagg 

atagccctgg 


ttgtttttgt 

tggttggttt 

aatttaatgg 

tttgaacttc 

cacatctaac 

tttgactctt 

gatccagggc 

tgagttcaat 

cctcttctgg 

tttaaaaaaa 

ttcaaggtga 

ttcatttcct 

agtattcttt 

tgcagagtcc 

ggcctaacac 

tgggtcagta 

atgagctcct 

ccacccagac 

atgtagttga 

ccagcacttc 

tgtgtccaga 

aaagatggct 

tcacaatagg 

cagaaagcac 

ataatagccc 

gcagtgtaaa 

gagtggcttt 

cgtggatttt 

ctttaatccc 

agagtgtcta 

agatctgtat 

ggacaggaaa 

ggactgttgg 

gaatctatct 

gctatgtgtt 

taaaacctaa 

aagagtgggc 

gctgctgctg 

tgtatacttt 

ttggcactgt 

atttatacag 

cagaaactat 

ccaaaggaca 

atcatctctt 

catcacttta 

tttcagatga 

cctcttactc 

tgtagttctg 

atctgatagc 

aaatttacct 

agtcatggtg 

tgggttcggg 

ttttgagaca 

aatcagacat 

gtgagatcaa 

gaggaggaag 

ggcctaagct 

ctgtcttcgg 

ttcaggcatg 

agcccaagat 

caagacagac 

cctcttacca 

agactctccc 

gggaagagaa 


gggtttatta 
tgctttactt 
actggttcca 
tggtcttcct 
aatatctggg 
catagtttct 
tggagagatg 
tcccagcaa c 
tgtgtctgaa 
aagagtttga 
gcctgggtga 
cattgatctt 
ttttcccaat 
cctttattcc 
tgaggacatt 
tgaagtgagt 
ctctgttctt 
ccattctctc 
caagaagtag 
tgtgtccata 
tattacctcc 
tgatggctct 
ctataacgtc 
agacatacat 
acgcttgagc 
agtaaaggtg 
cttctcatcc 
aagttaggaa 
agcactcagg 
ggcagccagg 
ttcaggaaga 
gtattccaca 
aatagttgaa 
gggcatggtg 
tgtggctaac 
aaaacaaaaa 
agaatgtctc 
ctgctgcata 
agctgtgctg 
acagaaatta 
aagcaatgta 
caaagtattc 
agtggagaaa 
ttacttctaa 
gatatataat 
gaaacaaaga 
actccatgaa 
gttgtcctag 
ctctgcctgc 
tcttaatttt 
agacaggctt 
gcagcctgat 
aaactttgtg 
aatattataa 
tccccagcat 
gaagagagaa 
ctcgactgaa 
tacatagtga 
tcttaaagac 
gatggtgcgt 
catcagtgat 
gattctgcaa 
agaacccatc 
cacacagatt 


gtgttttttg 

tttcttattc 

tgtggccaag 

tcatctacct 

tttctttatt 

tccagagtta 

gctcagtggc 

cacatggtgg 

gacagctaca 

tccatttaca 

actggcagaa 

cagaggtttg 

tctattcagt 

tgtgaccatc 

tcactgtgct 

attgaagaac 

ctgcacacat 

ttctgactca 

ctatgtcatt 

tcagctagat 

ttcgtttgtt 

tccagaggct 

atttccaagg 

gcaggcaaaa 

ttattcctct 

ccccgccctg 

cctaactgct 

atgacaaaga 

aagcagaagt 

gctatataga 

tgcctatgag 

tgtgtgcatt 

tgttcacact 

atgcacaact 

ctgggctaca 

tctaataatc 

tgattttgtt 

gcacacagct 

tcaactttgg 

acagccatat 

ctgtattaaa 

tctaaatgat 

aacttaggaa 

taatagtggc 

ttatctcatc 

cattgaaaaa 

tattcctttt 

aactcaatat 

cgagtgctag 

ctaagacgtt 

aatctttaat 

ctatagagag 

tcaaaaacaa 

acaagtatta 

aatgagggga 

aaaggaaaga 

gttgtccact 

agagtggaaa 

atagtggaag 

aagacagaat 

ctcgaaagca 

aggaatgtcc 

cctgcagctc 

tctggggcct 


tttggttggt 

atttttttat 

gatagcctca 

cccaagtgat 

ggagtttgaa 

ttacactttc 

taagagcacc 

ctcacaatcg 

gtgtaatcat 

ctggacttct 

gtggcagaag 

ctaacgggaa 

ctttagtagt 

tcgttgtttt 

atgccaatcg 

ctggtgtcct 

tgaaatcaac 

tgtcacctct 

gtccacagtg 

gtgctgctga 

tcttctaaat 

gggtttgatt 

ggtctgactt 

cacataaaca 

gatgatacag 

cccaggggca 

gcaacatcat 

ttagatctat 

aggtggatct 

gaaattctat 

ctgtttgaca 

ttacagaaat 

cagtgttctt 

gtattcctaa 

tagtcaatat 

tgggatttta 

cagatggcca 

aatatttgac 

aaaaaaagta 

tagtctagac 

tattcagtct 

gaaaaaagat 

gaattttccc 

catttattga 

ctaaaatgac 

tctaacttgc 

ttttttttcc 

atagaccagg 

ggttaaatgg 

tctcctctta 

ctcagcactt 

agttctatgg 

acaaacaaag 

gtgtcactca 

ggaggggaga 

tctcaaagca 

tttaatgacc 

ccaagggtca 

caggtatgca 

gcctggtgaa 

gcctcactgc 

agcatctgaa 

tcaccaatgc 

ccctctctgt 


tgtttgtttg 

ttctttattt 

actttgtagc 

gggattaagg 

agggattcct 

atttgttaca 

aactgctctt 

tctgtaatgg 

ataaataaaa 

tgaggcagca 

ctgtgtctca 

gtaagtggaa 

agatccctca 

tcagggagtg 

ctctgcagcg 

gcctgtggct 

tagcttgcaa 

cataggtgac 

catggatttg 

tagtatttta 

aagccaggca 

cccagcccca 

cctgttctgg 

taattgaaag 

ctctcctgaa 

tcagtgaggt 

caactgtgga 

ggccaggcac 

ctgtcagttt 

ctggaaagaa 

tgtgtgatag 

tggttataca 

caaatcaaat 

catgtagaag 

tggacagtca 

tattttcctt 

cagaacctag 

tatacgtata 

tcccagttta 

acattaaact 

tatctacagg 

tcaagaatct 

tttatcccct 

acatacccag 

ctgttgatga 

ctgcataaga 

tttgagacag 

ctagcctcaa 

atgtgtcacc 

aaaaatggaa 

aggaggcaaa 

gttaaagttt 

ccagactgct 

attaaaaagt 

gaaggaaatg 

gaacacagga 

cttttcatgc 

tcctggaatt 

tgggtatcct 

cctggggaga 

caagccaacc 

aataaagatc 

cttcgaggat 

cagcttatgc 


144420 

144480 

144540 

144600 

144660 

144720 

144780 

144840 

144900 

144960 

145020 

145080 

145140 

145200 

145260 

145320 

145380 

145440 

145500 

145560 

145620 

145680 

145740 

145800 

145860 

145920 

145980 

146040 

146100 

146160 

146220 

146280 

146340 

146400 

146460 

146520 

146580 

146640 

146700 

146760 

146820 

146880 

146940 

147000 

147060 

147120 

147180 

147240 

147300 

147360 

147420 

147480 

147540 

147600 

147660 

147720 

147780 

147840 

147900 

147960 

148020 

148080 

148140 

148200 
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acacaccctt 

ctggtgaagg 

cattgccttg 

tgtctgaagc 

agccaggaat 

gggctgctcc 

agatttgaag 

acctgtttac 

agtgaagaga 

tatggaagta 

gaacacagat 

agcgtgcagg 

gcagatttga 

gcccaggcaa 

attggtattt 

aataaatcag 

agctttgagc 

tctaagacag 

tctcttaaaa 

caagaggcaa 

caggctacat 

caatagcaac 

gcttttttca 

catgtatgtg 

gccactgagc 

atttattcat 

ctaccatgtg 

tcctgaacca 

tattatatgt 

tagagatgat 

gcagtcagtg 

tggtttccat 

gaacttggtg 

aatgatggtg 

catactacac 

tggagagtga 

aagaaaagtc 

tttctccccc 

gccggaggat 

aggacgacat 

atgttttgcc 

gggcatcaga 

ggaatcaaac 

tttattacat 

cccccccccc 

tctagttaat 

cagattaata 

tacactgtcc 

tgccaattta 

agcagtagga 

agttagtctc 

cctctcaggg 

gtgagtggaa 

atgggtgaga 

ctctgacaat 

ttattggcag 

ggacagaggt 

cccaggaatc 

tttcaccaag 

gagaaggaga 

tcaagacact 

tctgccttca 

tggctcggct 

ggtttggttt 


tgaaaggccg 

aagatgcttt 

agaacaagtg 

acactttggc 

gtatgcagca 

tcacactcgg 

atgttgatag 

ctgaaagcaa 

agagcaagat 

attataatgc 

tcatctgtgc 

cccaaacctt 

ctgtttgggg 

taacatccat 

caagagctaa 

tgacctaaaa 

acatttctgt 

ctttagctat 

acttgagcca 

aggcaggcag 

agtaagatcc 

aaattaattt 

gacacaccag 

gttgctggga 

catctctcca 

tttgtgtgtg 

tgtcttggag 

tctccctagc 

aagtacactg 

tgtgagtcac 

ctcttaactg 

tttatccaaa 

tgtgagcatg 

gcctgcctga 

actagctgag 

tcatctccac 

cttttaaggt 

tccctcattg 

ctttgtgaac 

agatataccc 

tgcatgtatg 

tctctcagaa 

caaggtcttc 

attttcctca 

ccccgagcag 

taagctgaaa 

ctctgttagc 

agattttgcc 

tacagtgttt 

tttgaaagct 

tgtggtacac 

tgtccagaag 

ttctgcagca 

gggccttctc 

cttagcctgt 

taaatcagag 

cattggctga 

tcaggtgctt 

gttatgtatg 

agccctgctg 

gatagacttt 

ttggctcttt 

agtaaaggtg 

ttgttttttt 


ccatctagtt 

tgtaagtgtc 

ggataccaga 

cacagtacct 

ggcatgggac 

ggtcttctgc 

agttgtaagg 

gaatctggtc 

aggtgaaccc 

tatcttcagc 

catcagtgtc 

aaagtcccct 

agcagccatg 

atgtcacaca 

tgtttaagga 

ggaaaacaca 

gaacccagag 

acagcacgtt 

gagccaggtg 

atccctgaat 

tgtttcaaat 

tttagatgta 

aagagggcat 

attgaactca 

gtccattaat 

gtatacatac 

atcaaactca 

catttctagt 

tagctgcctt 

catgtggttg 

ctgagctatc 

gatgccttta 

aagaccagag 

gattccagga 

ctgtgtgctc 

aggcaagcac 

ggtggtggtg 

tgatggcaca 

tggaggtcag 

tgtctcaaac 

tctatgcaca 

ctggaatttc 

tggaagagca 

attacatttc 

caagtattct 

gggagggagg 

ctaactacac 

atcctttttg 

ataggtattc 

gacatttatg 

attttggggt 

ctgctgcaca 

tcccggtcca 

caagggcctt 

gcatagtttt 

gttatccact 

aggataacgt 

gctgactggg 

gcagagggta 

ataagggaag 

gtccttaatt 

gagccactgc 

tttgcctcta 

gagacaaggt 


gccacaaaag 

cttatcccag 

gttaccagtg 

tgtggcagct 

ctctaccata 

catgttgccc 

atgctttgtg 

aaggcatttg 

ccaattcctg 

cttttgcccc 

tccgcactgt 

aagctgaaag 

ctgcgacaca 

ggtaagtcag 

gaaaaacact 

gccgtcttat 

cttgggaggt 

tgaggtcagc 

gtggtggcac 

ccagcctcgt 

aaataaatat 

tttatttatt 

tggatcccat 

acacctctgg 

taaaaattta 

tataatacag 

ggttcttagg 

attcttttct 

caaataccgg 

ctgggatttg 

tccagcccca 

agggcctggg 

ttcagatccc 

caacggagac 

aagagagcag 

acacctgagc 

ttgttgtttg 

ttcctttaat 

cctgttctac 

agacaaaaat 

ttatgtctgg 

agacacttat 

gcaagtagtc 

caatgctatc 

tcattgctgg 

tagatgttgc 

tatagaagct 

ngtgtatatg 

tttgtgacgt 

tggcctatgg 

ctatctgcgg 

ctgggctgga 

gctgggagtg 

cttagtgtta 

ttattcgaga 

cttagggaag 

actgagatgc 

tttcttaggg 

taggggtttc 

tcccccattt 

agcagggccc 

tacaaaatat 

agcctgaagg 

ttctcagcat 


acattctccc 

gagaaatgcc 

gagacctcta 

gcagctatgc 

gcacagagtg 

tgagaatgac 

acgaggttgg 

attacacaag 

gatgcaatgt 

atactgaaaa 

gcagacaact 

cagtgacccc 

tgctacagct 

aaatggtttt 

ataaaggaag 

atatcattat 

ggagatagga 

ctgaactaca 

atgcatttaa 

ctatatagtg 

aacaaaaaca 

ttatgtatga 

tacagatggt 

aagagcagtc 

aaactagagt 

gttcataagt 

catgggagca 

tttgtcttga 

aaaggggaat 

aactcaggcc 

tttttagtat 

aagatggctc 

tagcacccag 

agacaggggc 

ccctggctta 

acacagacat 

ggggtctttt 

ctcacatctg 

atagcaagcc 

tatttatttt 

tgcccatgaa 

caagtactgc 

tttttttttt 

ccaaaagtcc 

gccatctccc 

ccaaacttag 

tattctttag 

tctacagatc 

ggatctttta 

tcctgttaaa 

ttccgcatct 

aggatgaagt 

aatgctgggg 

cagctctagg 

tgagcttgta 

gagataggaa 

tcattagcac 

ggttgagagg 

aggctcccca 

tgagccactt 

aacaagtgtc 

ccaaatctgg 

cctgagtttt 

agccctgggt 


aggagaactg 
acgacctcat 
ctgtcaccga 
caagtattgc 
ttctcttggg 
tcttttagcc 
tagcacagac 
tcagggagag 
caatggaaag 
gcatagccca 
caaagctgac 
agggctgtgt 
gcagtgtaat 
tacttacatt 
cctggcatca 
gctattgaga 
tgattaggag 
tgagaacttg 
ttctagtact 
agatccccac 
gcaataataa 
gtacaccatt 
tgtgagccac 
ggtgctctta 
atttttaaac 
caatt ctctt 
agtatttact 
aagatttatt 
caggtcttgt 
ctccagaaga 
tcttattaag 
agtgggcatt 
gcagatgctg 
cctagctaac 
ctgtgcagga 
gcacaaagga 
gttttgtttt 
ggacaaagag 
catttcagcc 
atatatttga 
agccagaaga 
ctgagtgcta 
ttaatatttt 
cccataccct 
catctccttt 
gatttattga 
actttcacat 
ttaattcagc 
cccatcttaa 
tcacatttca 
cacacttttc 
ggagtccaga 
tcaggaggag 
caaaggcctt 
tgcatacact 
tacccaaggt 
ggggaggcat 
gtagcagtga 
gacaaagaag 
cagaaggcta 
tgttttcttt 
gccaggaagc 
cacttgattt 
attctggaac 


148260 

148320 

148380 

148440 

148500 

148560 

148620 

148680 

148740 

148800 

148860 

148920 

148980 

149040 

149100 

149160 

149220 

149280 

149340 

149400 

149460 

149520 

149580 

149640 

149700 

149760 

149820 

149880 

149940 

150000 

150060 

150120 

150180 

150240 

150300 

150360 

150420 

150480 

150540 

150600 

150660 

150720 

150780 

150840 

150900 

150960 

151020 

151080 

151140 

151200 

151260 

151320 

151380 

151440 

151500 

151560 

151620 

151680 

151740 

151800 

151860 

151920 

151980 

152040 
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tcactctgta 

gcttagatta 

gatggctcag 

gcaaccacat 

ctgaagacag 

aagaacacta 

ttcattctag 

gaaaggccac 

ttctacctgt 

gatggttgtt 

gccaccacat 

agttacccat 

gcacatattt 

agcccagttt 

aaaaacaaac 

ttgggcgtgt 

ttgggaggc:a 

gaaaccctgt 

gttaatatga 

ggagacattc 

cnnnnnnnnn 

nnnnnnnrmn 

aaccgtcagg 

attaaagcac 

gagaggcaca 

ctgggagaag 

agcagctatt 

tctgcagggg 

ttgtgtctat 

gtgtgtgtgt 

gtcagtgctc 

ggccaggctc 

taggaaataa 

aattggattc 

tatgcagagc 

gactgggctt 

tgcccgtcct 

gctactgaaa 

cagggccctc 

ctcctgaggc 

ccatccaacc 

agaaacctct 

gctaatgtct 

tgtgttgcat 

tgattcttgc 

gcccagggac 

tttttttctt 

tagtttataa 

tttatatatt 

taagtggaca 

aaatgtcatc 

ccgggggctg 

gtggcatagg 

tctgagattg 

tccaaaacca 

agcttcttca 

cactgtgctg 

atacccttcc 

c ggtggctga 

aaaaaaagaa 

agtgctgggt 

ggccaggagt 

acggatgggg 

gtcaaagctg 


gatcaggctc 
aagttgtgcg 
tgggtaagag 
ggtggctcac 
ctacgggtgt 
ttcacaaggt 
gggaaaaatc 
agcacgtgta 
ttttgaggca 
ctatttctgc 
aagcatggca 
caagccatct 
aatcccagca 
acaaatcaag 
atccttctgt 
caaaaatcac 
gaggcaggag 
ctcaaaaaaa 
aatagcagaa 
catactgaaa 
nnnnnnnnnn 
nnnnnnnnnn 
ctcagaggca 
caaatcttgt 
catagcttcc 
cttgaagctc 
cagcagctgc 
aaagacaagg 
gtgggtgagg 
gtgtgacaga 
cctttggctc 
gcccacccca 
aaaacagggt 
agcatttctt 
acaagagact 
gcctgcaggc 
ggggtcatcc 
caccctcat c 
agtaggt cac 
cagagctcag 
cattggacta 
tccccagctc 
gcagcttcta 
ataaaaacga 
tgcttctctt 
tattgttgct 
gctagaagtt 
acatgataag 
tatatatata 
taggtataaa 
cactggcgat 
cctgagacgt 
gttggacagg 
ctggttgccc 
gcagggtcct 
aggggcgcrat 
gagtggatac 
cgtctacggc 
aggcttcagc 
agaaagaaag 
gacggagccc 
ggggccccaa 
gtcctgcgag 
aacttgtgct 


aacttgaatt 

ccaacactgc 

cacccgactg 

aaccacctgt 

acttacatat 

acaacaacag 

tttttaaact 

tccaggttag 

gtctcttgtt 

ctctcatgtt 

ccagcactga 

tgctagcccc 

ctcaggaggc 

tcccagaata 

gttccagtca 

aagtagcaag 

gatctctgtg 

ccaagcaaaa 

ctcaagatag 

tatatgcaga 

nnnnnnnnnn 

nnnnnnnnnn 

ggatggcagt 

actaaaaaca 

gggcccttgc 

aagaactggc 

ggctcttggg 

agatggcact 

gaagtgccgt 

gacagaaagg 

ctcctgccaa 

ccaaagccca 

tcttgtccaa 

actcctcagt 

gtgcttgcca 

tgccttctgg 

aggaggagca 

tgctaaggcc 

tggccaaaga 

agcctccctg 

acactgtaga 

cgcctagctt 

cagtaggggt 

ggtgatgttc 

ccttcccgac 

ggctggggtt 

accgatacag 

acatacagtt 

tatatattat 

actgcacctt 

ctctcctttg 

caggaatgag 

gcaggttaac 

cagctggctc 

tcagcccttc 

gtcccgacac 

ggtccagagg 

tgtacatttt 

agccttttcc 

aaaagaagaa 

tgggtcctca 

gggccgtgcc 

cacacttgcc 

gatagtcggg 


cagagatctg 

ccacctaaaa 

ctcttccgaa 

gatgagatct 

aataataaat 

tgtactagga 

ttagtgtgta 

gagtcaactt 

tctgccctac 

gccctatgca 

gccaagggca 

agaaaatgta 

agaggcaggc 

gccaaggcta 

caatgactgc 

gtgtaatggg 

agtttagcac 

atagaattac 

ctaatgaaac 

gcctgatgct 

nnnnnnnnnn 

nnnnnnnnnn 

gatgtgtttt 

gccctctgct 

cctctgctct 

cgctgttgcc 

caagcttgga 

gagctgggta" 

gtgtgtgtgt 

gagagtgcat 

aaagcatgct 

ggatctgcca 

gagaattttt 

atcctctctg 

gaaacccttg 

gctgacccct 

ggaccactgt 

actagatttt 

gcctagatgt 

gcctgcacaa 

cgctgccttg 

gcctccagca 

gacggggtgc 

taagtatcta 

tcttcccact 

ctgaacaaaa 

tccttaattg 

tggtaaggag 

gtgggtgtgt 

ctgtgtgact 

gtgattggtt 

aggcattacc 

taagtgatct 

tgggagagcc 

tctccaaatg 

cactaatgac 

cgctcggtca 

gggcttataa 

tgaaacccag 

aaaaattctg 

gcggagagtg 

tgaggatgcc 

cttcctcttg 

ggcatagtcc 


cctgcttcta 

aaaatatgag 

ggtccgaagt 

gatgccctct 

aaatcttaaa 

aactttaaaa 

agaaagagag 

ttcagaagcg 

actttgtaca 

tgctgagctt 

ggcatctggc 

tttttgacag 

agatctctgt 

catagagaaa 

tgtaacaata 

caggccttta 

agccagggct 

aagttaacca 

aaaggattct 

tgcctagcaa 

nnnnnnnnnn 

ngcctgggca 

ggggcacaag 

ctcgatctgc 

cccggtcagt 

tttgtgctga 

agtagctcgt 

gccaggatgt 

'gtgtgtgtgt 

gttcttgtgt 

gtgttctcag 

ccccattaag 

attattattt 

gtagggaatt 

ggccaacagc 

gagtctggcc 

gatacagggt 

tttcccactg 

taaaactgca 

agtgctaaga 

tctgccagct 

ccctgacatc 

tccggggtgc 

agaatgttgg 

gacacctttg 

ttgccgcagt 

agcaaatata 

tgggttgggg 

gggcagagtg 

ctcattgcga 

cttggacccc 

cccatggata 

cagacaagag 

ttgttctgag 

accaggcttc 

tcactttgcg 

ggacagccga 

tccaccagga 

cagatcttcc 

ttccttgctg 

actgccagcc 

tgctgcaccc 

ggtcgtgcgg 

tgcagttcgg 


catcccgagt 

gggctggtga 

tcaaatccca 

tctggtgcat 

aaaaaaaaaa 

tagcccatca 

aggggctgta 

gagtctcccc 

tgaacttcaa 

actgatgcca 

tcacatggag 

gtgtggtggt 

gtctgaagcc 

ccctgttttg 

atatgaggat 

gtctcagcac 

gttacacaga 

gggtattggt 

attttataaa 

ctgaactaca 

nnnnnnnnnn 

agagtaagca 

gccccctttc 

agagatacga 

tcctggactc 

cgccagaacc 

tccttctcct 

gggaagtaaa 

gtgtgtgtgt 

gttcctgtga 

agtcaagctt 

aatgcggggt 

tttctctctt 

caggtctgtc 

cctattccct 

ctctgacctc 

tcctgagatg 

ccagactact 

gctaaagcct 

aggacattgt 

tgaaaacccc 

cacttctctc 

cagacaggcg 

tcccctgaag 

cccccagcaa 

tttttgtttt 

ggttcccgta 

gacctgtgca 

aggatatata 

gtacagttct 

agcaggtctg 

gggactgagg 

ccaagagtgc 

tcctgctcct 

cgcagagccc 

tgcctttgac 

gtgagacgtg 

agggtccaga 

acttaggaaa 

gacatgtggc 

ccagtatcca 

cactgggggc 

tgggcatgat 

taagctcttt 


152100 

152160 

152220 

152280 

152340 

152400 

152460 

152520 

152580 

152640 

152700 

152760 

152820 

152880 

152940 

153000 

153060 

153120 

153180 

153240 

153300 

153360 

153420 

153480 

153540 

153600 

153660 

153720 

153780 

153840 

153900 

153960 

154020 

154080 

154140 

154200 

154260 

154320 

154380 

154440 

154500 

154560 

154620 

154680 

154740 

154800 

154860 

154920 

154980 

155040 

155100 

155160 

155220 

155280 

155340 

155400 

155460 

155520 

155580 

155640 

155700 

155760 

155820 

155880 
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cccagagctc 

cttcttgtaa 

ggccccatgg 

agacttgatc 

cagcttcaga 

gccacggaac 

attcccattg 

ctcagtgagg 

cttgtggtga 

gccccggaag 

gagattgacc 

gtagagatac 

acacttatag 

cgttcgcagc 

ggtgttggga 

gctgaagtgg 

gtcctctggg 

gggcgcaggg 

gagcagcaac 

ggggctgccc 

aatgactctg 

cagggcattt 

tctctaattc 

ttgtgaacac 

tagctgcctt 

tatttaaata 

aataagcgcc 

gtttcctcat 

tgaggggctg 

tgccagatcg 

attgccaggc 

tgtagcttcc 

attttcggat 

ctcattatgg 

ccagcaggcg 

ggccaggcag 

gttcagctgg 

ttcttgctgc 

cccacttggg 

gacttctgac 

gctgttggga 

ggcttcagct 

aattgggttc 

taggacccct 

cggcccttag 

tgtctagatg 

ccagcctctg 

ctgccctgtg 

tccctgcttg 

cggtcccctg 

tttatgctgg 

cttttcttcc 

accattcgac 

ttcccctgtg 

ggaagtccct 

gctctattta 

agtttataaa 

actggggttt 

tgcagaacca 

caatggacca 

ctggaagcat 

tccacatgcc 

tgaggggctc 

ctgagcaagc 


accttagaga 

cctgacctgg 

gacggatggt 

tggtgaggag 

tcctggcctt 

cttcgcagcc 

aggcgaagga 

ctgttgttga 

acccactgta 

atgccttggc 

aggtccacaa 

tgcaggctgt 

aggtagaggg 

tgtcggttgt 

gcaatgaaag 

ccctgctgga 

atgccctccg 

tagcacacac 

agcagcaatt 

agaggaggtg 

ttggctcaca 

ggggccaagg 

catccgaaca 

aggcgagtcg 

agcaaggaat 

attcctcaaa 

tctttcacac 

gagttaaaca 

tgagtgacac 

aggctgcatg 

tgtggctgta 

ctctaattgg 

gtgtccagcc 

acgattgtca 

cgtcaggaag 

ccacgaccta 

gactcttggg 

tagctgtgct 

gtcccaggcc 

gccccccccc 

gttatgggct 

agaatgtttc 

agttgtgatt 

ttctgggccg 

acagcatgca 

ttactctgtc 

tcctagggat 

ccaccagcac 

gcttgcaccc 

ggaaagcctg 

atgagacaca 

ccggcctgca 

atttgtacag 

ttctcccacc 

tggggagatt 

ggaaaaaaaa 

cataccgtgg 

tccctctaaa 

ggctcagttc 

caaggacctt 

actaatagga 

agggatgtta 

agccactctg 

tgtcctagtc 


tctggttccg 

agccaggcgg 

gctccttgcg 

acactggtcc 

gccgcagctc 

attcccacag 

actccaaggc 

agagaaagag 

gctggttctc 

ccaggctcca 

agatgtcatc 

gcaggccacc 

cgtgaagctt 

ctccaaggtc 

tgatgttgtt 

ggaaggtgat 

ggatggcagc 

agtctcgagg 

ccacacagca 

gagatagatg 

gaggttctgt 

cccacattat 

cttcttcaga 

gcctccggct 

gactctaaca 

ataaagaaag 

tgtggtacaa 

cagttcacag 

cagggctgac 

ctcctctccc 

aaaccgggcc 

gcattaatta 

tgtggtcggg 

ttacctgcct 

atggggacag 

tgtcgcctca 

agagccaggg 

cagagcaagt 

cagggactgc 

cattcccaac 

cccagagagt 

tagcacataa 

tcagagcaat 

ggccatgccc 

ggctataacc 

gatcttcctc 

cctgtgctgc 

tcacactggc 

tctcttccag 

gactctaggt 

gcagagcctt 

gctgcaggca 

ctcatcggag 

tactgtggct 

acccccctta 

aatcaaactg 

caagtggagg 

agggcaatgg 

ccctgttatc 

gtccttgtag 

tgtaggctca 

accttccaag 

catccatgtc 

agccctggcc 


gttcctgtgg 

atggccatgt 

ggcagccctg 

tgtgcagttc 

gggggtcgcg 

ggaacgtgcc 

caccaggggg 

ggtggttagc 

atgcagcagc 

tagcttgtta 

ttggaggtac 

aaagatgcct 

caccaggcct 

tagctcctcc 

ggagtagatc 

gcgattgttc 

aaagttgtgt 

acaaccacca 

ccctggtggg 

gggaacagag 

cctctgtatt 

ctcctcacct 

ctgtcagccc 

ctgggtccgg 

aagcaaatct 

cattgagtcc 

aaacttcaaa 

ggctgtgtgt 

gcacgaggct 

ccattcacac 

ttgcctcttg 

ggcattcgtt 

taataggcct 

ttttccaggg 

gctccaggcc 

gttggcctct 

cccctggggg 

ggaaggagca 

ccaggccccg 

agaaacagca 

ggcagctgct 

aaatcatcgc 

tatgacctca 

tgcaggccct 

atctctgcct 

ctcagcatcc 

agagggaggc 

tgcacagtcc 

gaaggccatt 

tcaaggacat 

cttcactggg 

ggtaggggtt 

tttacagagg 

gctctgtctc 

cataggggcc 

aaccaaacct 

aggcaggcgg 

ggggacaaag 

tcaaggctat 

gacagtcact 

ggacagctgg 

cctccatctc 

tttgaaagcc 

acttctggac 


ctggtgcagt 

gggtggcctt 

tcagaggtgg 

cggaagtcct 

caggggacag 

cggcagccac 

gccagacagt 

ctgtggaggt 

aaccggtcca 

ccatggagaa 

tcgatatggt 

gcgggcaggg 

tggaaggtct 

agatgcacaa 

cagagggtga 

tgcaggaaga 

gcctggcagc 

cccagaggta 

gagagacaga 

ggtggaatgg 

gtatgcaggg 

ctttagctct 

caccgcaggg 

ctctgccact 

ggagtcctga 

atggtaccaa 

cctacaactc 

acagacaagg 

tcccttgggg 

ccccccctcc 

actgtccaga 

aatggatcct 

atgctcatta 

tcacagctgc 

tacgggcgcc 

tgccccttct 

aatatgagct 

gggaccttct 

gcagagtagg 

tcgtaagttg 

tctcgtccct 

atataattta 

gcagcagggg 

ctgcagtgct 

caatttcctg 

tgggtgtggc 

acagtcggag 

acagacccac 

cttgccagaa 

tcatgatgct 

gggtcctgtg 

gcagtgggct 

gcttttgttg 

tgtgcacccc 

tccaaggaat 

caggtgtgga 

cagaacggca 

ggactctagc 

catcactagg 

tcctgcagtg 

tctctgtcct 

ttctaatggg 

agtggtatta 

tccttgcctg 


tcttgcctgc 

tgtccctgga 

taagcgtgtg 

ccaccctcag 

cagagctaga 

agtcccaagc 

caccctgcag 

catggaaagc 

ggttcaccag 

acaagtgact 

tgtcctgcaa 

cgctcagtcc 

cgggtgccag 

agccctcgaa 

ccatggcggg 

tgcgctcact 

tgacagtcat 

gctctccagc 

acagcagtga 

gggactggca 

gtcccttgga 

gtccctaaag 

tggaacacac 

cgctcactgt 

atgatcactt 

agcatgcctc 

ctccatggct 

cacatttctg 

ttcacagtac 

tgtgctggag 

gcatttcctc 

taaaataatt 

tggaagccgc 

cccaagtggc 

caccctgaag 

tttccagctt 

gagctgaatc 

gaccaggctt 

ttgccacctt 

acagctccca 

gtaatcaccc 

gtttttgcat 

tggcacagca 

ttctgcccac 

ctccaaaacg 

ctccagcctg 

ggaggggagc 

agctccaacc 

cctttcccaa 

tgccccacat 

aaaatgaaag 

tatcactaat 

taccctcaac 

aaaagaatct 

acaggctaca 

cttagtaacc 

tgaaggtagg 

caagacctga 

aggcctaagg 

aagtgctctt 

ttagtatttt 

gggggggtgt 

ctccaggacc 

agtcagtagg 


155940 

156000 

156060 

156120 

156180 

156240 

156300 

156360 

156420 

156480 

156540 

156600 

156660 

156720 

156780 

156840 

156900 

156960 

157020 

157080 
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tgccaatcct 

atggcgcctt 

attaatgtac 

aggaatccaa 

cttgggcagg 

cttcccctcc 

tcccacgcac 

acttgccccc 

atcatgcccc 

gaaattaaga 

cttggtcact 

tttgaactag 

gctcagacct 

tgggtcagct 

catggtccca 

atgtgcaaag 

agcctccaag 

accagaacca 

agggg cc ctg 

gtctttaatg 

agcatcagtg 

ccctgccttc 

tcgaaatggg 

agcacccaag 

ggtcagtggg 

ggccttcagt 

tgttaaccga 

°tggggctgc 

ctccttcctg 

ctcagcctcc 

cagggctgtg 

agcctcgttt 

cccacagata 

ccaaaccaag 

taaagatggt 

ctctggggaa 

actcgctatt 

gccctgcctc 

agtcaggaag 

ccccaggtct 

ccgtccctcc 

tacctggacc 

agcgatttgc 

acagcacgac 

tgctccctgc 

gtctctggag 

tcatagtagg 

ggagaacaga 

ccgtttgttg 

aaatagactg 

atgctaaaga 

cagaaatgtc 

aggtgcttgg 

aagtgagttc 

taaaaatctt 

acacacacac 

gggctgaact 

tgtctgtgta 

tggatggaga 

actcacatct 

cccaggccca 

gcccagcatc 

agcaccccag 

cccaggccca 


aggattgtca 

ctatgccccc 

caacaatgag 

gtctctgtgg 

gctgccttct 

aggaggtgcc 

gtgagctccc 

tagtctagga 

atttgcatat 

gaggggccta 

gatcccctct 

ggctgagaag 

cagcacacac 

ggcaagcccc 

tgggaccaga 

gcactggggt 

gaatactgta 

gcatcaacag 

tagcaccctg 

aaaagccctt 

ggctggttgc 

aggctggggc 

ctgggtcagc 

ggggctgccc 

tctctctgaa 

cctccctccc 

atgctctctt 

tggcgcctca 

gctcggccct 

actgctgtgc 

gagccagagg 

tcctcacctg 

ctggtaccca 

ctcttctctt 

ccctttacat 

attctgctgg 

gctaccattc 

ctcaccccta 

ggctgctcct 

ttctcccagc 

cacatgccag 

ggttcccagt 

aaggagcttc 

ataagtcact 

acacagtagg 

aattgaatga 

tgctcgataa 

cacctgaact 

gatgagcact 

ggactgggga 

aaaaaaaaag 

tgggtttatg 

gaggcaaaga 

taggacagcc 

agtgggggag 

acgttagtat 

cagagtgtta 

gatgtgtacc 

catggttgtg 

acctctggag 

gcaccccgac 

ccacgcccag 

acccagcacc 

gcaccccagg 


ccagcaggtt 
tctatgaggc 
gatggagcct 
aggggccctg 
acttgagggg 
cttcatcgtg 
aaggggctct 
gagccatgga 
gcggtggcaa 
aggaaagcct 
gtaacaccag 
ggtctgctct 
tcaacagaag 
aaactcttcc 
cctggtagga 
gggaaagggc 
acctcagctc 
ggcctgtctg 
gccaccagaa 
ccggaagcca 
agctggagaa 
tctagccagc 
tttttcaggc 
ctctacccct 
gctttgtagg 
tgcccccag c 
gctgtgggga 
ggccacttgg 
gaagacaaga 
agagcggccg 
cctaagttga 
caaaatgggg 
cctgcccttc 
cctctccggc 
cagctgcctg 
ctagatttta 
acccagtagc 
cacctccttc 
tagtctgcat 
ccagaggccc 
ccctcagcct 
gtgtctgcag 
tttcctggga 
tgtttccttt 
tgctcattaa 
tgacactcag 
atgttggttg 
tggctcacat 
ctgagaagga 
gtttcccaag 
ttagtcactc 
aagagaggaa 
cagatggagc 
aaggctacat 
cagtcaagga 
aatatcatac 
gtgtgtgcta 
tgaggtgttt 
tttgctaggc 
actgagagtg 
ccagcatccc 
catcccaggc 
ccaggcccag 
tccaagcact 


tcttcctagg 

ccttgggagc 

ttgccatgca 

cacctcttct 

gcggaaggga 

ttctgcttcg 

acagcctccg 

agacagtgtg 

taccctggtg 

ctgctatccc 

cccttctgca 

gttcagctgc 

gcatgcaagc 

tgccaagctg 

taggtggcaa 

ctgtgcttct 

agctgggctc 

ctaaacccag 

ctaacgagga 

catttgcaca 

cacagcaggg 

tccctgatgc 

tccaaagggt 

aagtagaggc 

ctgcttcttg 

ctggccagct 

tggcggcctc 

cacttgtgcc 

aagctggcca 

gcagcctcct 

atcctggctg 

cagtcattgg 

ccacatctcc 

ctcctccgtt 

ggaggaacac 

gaatgtatcc 

ctcctggatg 

aacaggaact 

cctccagagt 

tgcctctccc 

ccctcctgcc 

ggctgcttcc 

cattttactc 

atgtccgctg 

cagttctggg 

gaagccctag 

taattagtcc 

caagactcct 

cccaatacca 

gtatgggatt 

taaaaggggc 

gccagatatg 

tctgtgagtt 

agagaaaccc 

ggaaacacac 

tatggctctg 

gtggatattt 

atgtgtacac 

atttgcatgt 

acaacccagg 

aggcccagca 

ccagcacccc 

catcccaggc 

caatgcccag 


gaggcaagca 
cccgccccac 
ttttaacatt 
gccagactca 
gaagacccag 
ttactctcaa 
tcattccttc 
ggaagggctt 
ggtaccagga 
tgggctacca 
acctgccaga 
cttggctggg 
aagggagcta 
agcatgaaaa 
ggctaaggca 
caccccctct 
gggtggccag 
acctcacaag 
agatctgacg 
gaagaaaatg 
ggacaggtcc 
ctggagtagg 
caggacagct 
atccccatgg 
gccatgtagc 
gctcttctct 
aggccaggcc 
acttgtgttc 
gggaacagct 
atccatggct 
ctccttttaa 
aagctcagcc 
atctgtctaa 
cacaacttct 
ctcagaatca 
agtatctatc 
ccctcc cccc 
agggtagtcc 
gcccatctaa 
tggctacaca 
tttgtccatg 
cagagaagcc 
tacagcacgc 
ccactttgtt 
ggagggaatt 
gctcaagcct 
tgggagactc 
taggcccatc 
gtcctttcct 
ttcagcacta 
caggaccaaa 
gtggggcaca 
tgaggccagc 
acttgacttg 
acacacacac 
tgcctgcagt 
gagctctgta 
aggtgttggc 
gtctgagttc 
gcccttttat 
ctccagaccc 
agacccagca 
ccagcacccc 
caccccgacc 


ctgtatcacc 

tgattgcctg 

gcaaattagc 

tcaagcgcct 

ttccactctc 

gcctccggcc 

ttccattcat 

gacaatgagc 

gagtataggg 

gtcagcattg 

gtttttgacc 

aggggaatct 

gcagtggcct 

gccacctcac 

gcggaatagc 

aatggtgcag 

agagcttggc 

ccagtttagt 

ctgggaatat 

aggtgcccag 

taccaagcta 

taaagcagcc 

gctgcagctt 

cccctgggca 

caatctctct 

gagcaatcga 

agctgcactc 

taaacacagg 

gggctcccat 

gtgagtagaa 

tgcttgcagg 

agtcctccag 

ctgaaaccat 

ccatcttggg 

ccgtggctca 

cacttatcat 

ccccccgcaa 

agggaaagtg 

ctagaagctg 

gaccttgctg 

ctgttccatc 

actcttgagc 

gacctcttgg 

ctgatgagtg 

accttctcaa 

ggggcctgga 

agagccttca 

aaggaagtga 

gggcaagggg 

aactggaata 

acctttcaaa 

tctttaatcc 

ctattctaca 

ccacccaaat 

acacacacac 

ccaggaatga 

tttatgtgca 

ctgttgcata 

atgcacataa 

cctgcagcac 

agcaccccag 

tcccaggccc 

gacccagcat 

cagcatccca 
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ggcccagcac 

cagcatccca 

atcccaggga 

cagggacagc 

tggaggcagc 

cctcattgct 

tcccagtacc 

gtctggggtt 

cagcacatcc 

tggagcctac 

tgggagtcat 

gtatgggaat 

ctgcacggga 

aactcaggat 

ccagggctaa 

tctgtcatcc 

ctacttctcc 

aatcttctgc 

tcccttttac 

tttctncncn 

nnnnnnnnnn 

aaattgaact 

ccttgtgatc 

tagtggttct 

aggcattgct 

tcttaagatg 

agcatgtgct 

acaaaaccaa 

attgggacac 

gaggatgaaa 

aaaatatata 

ttgactgtat 

cccaaaattc 

caggaatgct 

acacttccag 

atgtccttgg 

aagtatttat 

ctaggcctag 

cagcctttag 

tgattcagtc 

gtcttttctc 

tctaatctct 

ttctctcttc 

tgtgccgctc 

ctgtgctgca 

ctctcttctc 

tttgtctgcc 

tttaccttca 

tgagagtaaa 

tctccataaa 

ctgccatgtg 

aataagcatt 

cgactacatt 

agtgaatttt 

ttatattaag 

tttcttcata 

cttatggaaa 

cctcccatct 

tgaactcctc 

gagataaggg 

catgacccct 

ctagtgatgt 

agcctatgaa 

cgaccccaga 


cccaggccca 
ggcccagcat 
cagcacccca 
accccaggcc 
acatactgaa 
ttcagcacct 
aggattcccc 
ggcagcccca 
tgcccattag 
cc tgtatagc 
ttcacctagg 
aggtaaggag 
gattgtgtca 
ggtccagaat 
gacccttcct 
ccatctgtgc 
cagcatccca 
taggtttgag 
caccaccact 
nnnnnnnnnn 
nnnnnnnnnn 
tacagaggaa 
agaaagctat 
ggggactgaa 
tatgctaagc 
cctataacca 
caaaaacaca 
acaacaacaa 
atttatacta 
agcccctgca 
gtgtatttat 
atctttgaaa 
ctgagttgga 
ctgatctaat 
gaggcagaaa 
agaggctacc 
ttgactgatg 
ccctggaagc 
gacttgctgc 
cccctgttcc 
ttggcctctg 
tgtctccttc 
catccttctc 
tactctccct 
ctctcttctc 
ctgagggttg 
actcaattag 
ttgtttcaaa 
gatgtgtgct 
taacagaatc 
gtgaagagaa 
ttcttgggag 
ttgccatgcc 
aaggcagcca 
aattgggaac 
tacaacttag 
ctgaagaata 
agagattgtt 
accctagagt 
cctcctaaaa 
tagttacgta 
aaacttgtac 
aaatttggct 
gctctggtct 


gcatccaagg 

cccagggaca 

ggcccagtat 

cagtatccca 

gatagggaat 

gctctcctca 

tcagctgagg 

tgctaattgg 

gcctaacctt 

tctgtattga 

caactccaga 

agcatagtga 

ctgtggttcc 

ccagagctct 

tggacggttt 

ctaacagctc 

tgcccgcacc 

ggccccaggt 

aatcctcttc 

nnnnnnnnnn 

nnnnnnnnnn 

acacaaatga 

agattaaacc 

cccaaggaag 

tgacaaaggc 

tgggtggagc 

cataaagatt 

aaagtactta 

ttctcgttct 

gaacaacgca 

gtaaaacata 

agaaccagca 

ggttcaatgc 

caatggattg 

caaggtgagg 

tcatccttga 

taactgtcat 

ttctagcttc 

tgagatcacc 

gggctcaaac 

aattgctctg 

acactctctt 

tgtaaagctc 

ctcagctcta 

cggtaccacc 

ggcagatcct 

acatcacttt 

ttaaaggtga 

aataaggctg 

ttagggttca 

tgcagagagt 

ttggattaga 

ttgagactgt 

aggctgtggt 

aaaaaagcag 

tttttatagt 

aaaattttta 

cccggaacac 

tcgaaccctc 

caacctcaaa 

gattcccttg 

tttccctgcc 

ggtcgtcgat 

atgttccatg 


cccagcatcc 

gcaccccagg 

cccagggaca 

gggacagcac 

gtctctgagg 

cactcggaat 

aatgggtagc 

actgacagtt 

gcctgcaggg 

ggcactcccc 

tgggcacaaa 

gcctggctgg 

agactgccaa 

taagagagca 

cttggctttc 

tctgtggaaa 

ccagctcact 

tgaggctgtg 

ctcctcttcc 

nnnnnnnnnn 

nnnnnnnnnn 

gctgtaatga 

ccagcccttt 

gacagctgct 

tttctgaatg 

aattctctaa 

gtttactgaa 

atagctcatc 

atcacacgac 

cacaatgaaa 

cataaacaca 

gagtgctatg 

agagttcata 

accaactgat 

cctaactgga 

tccttccctt 

cttggaggct 

catacaatct 

gtttcctgtt 

tcctctcccc 

cttggtctca 

gcttgttctg 

tcccggtaaa 

ctgcactgct 

tgtgtctccc 

atcctgtcaa 

caagcaggag 

gtactaaggg 

agccaactct 

caatacgatc 

taaggatgtg 

gtcattcctg 

ggagggctga 

tggactgtta 

agtagaaagg 

tagcttttta 

ctgaactctt 

tcctgaactc 

ccaactaaaa 

atgaaccggg 

gcagaacccc 

cagctctccc 

tctcctctac 

tgctttcttg 


cagggacagc 

cccagcatcc 

gcaccccagg 

cccaggcaca 

cctcttatct 

caaacaccct 

taccattttg 

tctcctgaga 

gtgtgctgta 

caagctatga 

aatctctcca 

gcacctgaga 

gacatcttgg 

gatgctgaga 

tactctgtcc 

acatgaggcg 

gtgtgccctc 

ggtttccctc 

tcctcttcct 

nnnnnnnnnn 

nnnnnnnnna 

cagaaaagga 

ttgcctttgt 

tggaatgttg 

catatggtag 

gagtctgtct 

gcagtatttg 

agtaggaaaa 

tattaagaag 

ttgaatagag 

taaaaagaat 

gtttataaat 

ctcttaggaa 

agattaataa 

ggaagtttgt 

tctccgcttc 

tactggctcc 

aatccaagcc 

ctttctgaac 

gatgatttta 

aactaactct 

tctttactgt 

cctgcctcct 

ctccacagct 

ttacgtagct 

acctttctct 

tgcctcctct 

tgtgtctctt 

agctagaaat 

aaatatcctg 

ctcttgaggt 

tgacactgtg 

gcgatggatt 

ctagctacgt 

acagttttgg 

tgttgctatg 

cttcacccca 

ttcaccccag 

actgttccaa 

tacattgcca 

ttgtcccttg 

cccttgagtt 

accactaggt 

ctgttgttct 


accccaggcc 

caggtccagc 

cccagtatcc 

gtatcccaca 

tggtccttac 

gtgcaggttc 

gcttttgtct 

gcaatttggg 

ggggcaggga 

cccatgccag 

ataagggtag 

cctgagcagc 

ctttcacccc 

ggcacttaac 

tctgtcccag 

tatgagctct 

atgttactca 

catctgtccc 

cctcctcctc 

nnnnnnnnnn 

caaaggacgc 

atactatttc 

ttttcatatt 

gagacctctc 

tatctattca 

gaacttctta 

caacagtgaa 

agaacaacaa 

attaaggtgt 

aataaagaaa 

agaaaactct 

gaaaagttcc 

gtaactaaat 

tctgaaggca 

caccagaggc 

ctggctgtaa 

atcagctaac 

tagaatgttc 

tctagctggc 

ttcacaatct 

agcaatcttt 

gtctagtttg 

cctccccctc 

ctcctgtatc 

tccctttcct 

gattcttcac 

acaaaccaac 

tttcagccag 

agtttctttt 

agacagctgg 

ttcagatggg 

acaaaggact 

aatacactgg 

ttagctggat 

cagaaggagg 

accaaaatac 

gaacccgacc 

aatgctttcc 

gaacattttt 

aataatagga 

acagaacccc 

ttactatata 

gcatgagttt 

attaaatctt 
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gccttctaca 

tgagtgcttg 

acagcgcgac 

ggtctgatgt 

gctcagtgag 

tgtccaccgt 

cctggtggac 

tcccacctcg 

agaccgtggg 

ctcgtgcaga 

tgtggtcgac 

ctcttgttgt 

tgactctgga 

gaaagggtcc 

cacctgaggg 

aggaaggggg 

tcccacctcc 

cccagtctga 

ttgacgacct 

ctgcagctgc 

ctcggagtcg 

tgccactcag 

agtactggcc 

cagagaaccc 

cttgggatga 

tcctcataga 

acctcgtgga 

gtaggggacg 

gacggcccac 

cagtcttcct 

cagaggggca 

agaaaaagct 

aagcagaaaa 

aagaaataga 

ttttggccgc 

acagggcagt 

attgcaaaga 

tcctaaccct 

taactttgtc 

cagtactcac 

ctggtagtaa 

tgacccactc 

taaccaaact 

aggcccctgt 

aacccaaaaa 

aagcaggaat 

ccacccccat 

ctcatatcca 

caccacttct 

gggaggttaa 

taagctctct 

tttgcctgcg 

agggcggaca 

ccaccctgtt 

agcttaccct 

gtcaccaggg 

ctaaaaaggc 

gcaaaagatg 

ccccacggca 

gctttgcaac 

ggaaagaaga 

cgctagcatt 

tagccagggg 

caaaaaaatt 


ttttgagtac 

agtgagggtc 

cacccagagg 

ttgtgttctg 

accgcgctcc 

ccgttcaccc 

ccctttggag 

tgcccagttg 

ttcaagtccc 

gggtctcaat 

gcagagtcgc 

gtctactgtt 

gcattggaag 

gtggcagacc 

tgcttttgac 

tcaccctgat 

gtgggtcaag 

tgcagccgga 

cctctggata 

cccaccacag 

ccgaggccga 

agcacatgtg 

tttttcctct 

ctctggactt 

ttgtcagcag 

ggcgagaaaa 

cgaggctttc 

cctccttgtc 

caatttggct 

tgagcgtcta 

gagagccgct 

gcaaaggctg 

agtctatcat 

ggagagggaa 

agttgtgaat 

gaaaccgcaa 

gaaaggacac 

agaagatgat 

cgtggagggg 

taacccccta 

attttacccc 

ctttctggtg 

aaaggctcaa 

tgcctgcctt 

tgcagtctct 

ggggttggct 

ttcggtaaaa 

gaggttgctg 

gccggttcaa 

caaaagggtc 

cccacctgag 

tttgcaccct 

gactggtcaa 

tgacgaggcc 

actacagtat 

aactgagagg 

acaaatctgt 

gctcacagag 

ggtacgtgag 

cctagcagca 

acaccaaaga 

accagacttg 

agtattgaca 

agatcctgtt 


ggtctcagtg 

tcccttcggg 

tcctagaccc 

tttctaagtt 

gagagggaac 

tgggagacgt 

gccaagagac 

cgagatcgtg 

acctcgcgtt 

cggccggcct 

cgccgtttct 

tttctagaaa 

gaggtgcggg 

ttttgcgcct 

ttgtcactaa 

cagatcccct 

ccttggaccc 

aagtctggcc 

gactcccaac 

ggaccaatag 

agccccgggg 

agagggccag 

tctgatttat 

actgggctcc 

cttttgcagg 

aatgttctgg 

cccttgaacc 

tatcgccgga 

aaggtaagag 

atggaggcat 

gtagccatgg 

gaggggctcc 

aagagggaaa 

aatagacggg 

gatagacagt 

ggtggcaaaa 

tgggctagag 

tagggaagtc 

actcccgtca 

ggcaagctag 

tggacgacca 

atacctgagt 

gtccaattta 

gtcctcaaca 

tcaggttggc 

aaacaagtgc 

caatacccca 

ggccaaggag 

aaaccaggga 

ctggacattc 

agaacatggt 

aagagtcagc 

ctaacttgga 

ctccatcggg 

gtagatgatc 

ctcctcacag 

caaactgagg 

gcccggaaaa 

tttctgggga 

cctctatatc 

gcttttgagg 

actaagcctt 

caagcactgg 

gctagtggat 


tcttcttggg 

ggtctttcat 

acttagaggt 

tggtgcgatc 

gcggggtgga 

cccaggaaaa 

catttggggt 

ggttcgagtc 

tggtcacgag 

tagaaaggcc 

ggtttctttt 

tgggacaatc 

tcagagccca 

ccgagtggcc 

tcgctgccgt 

acattgtgac 

caaactcttc 

catcagcacc 

ctccccctta 

cgagaggggc 

aggaaggggg 

caccaggacc 

ataattgaaa 

ttgagtcact 

ttctttttac 

gagaggacgg 

gccccaactg 

ctctagtggc 

aggtcttgca 

ataggagata 

ccttcattgg 

aagatcatac 

cagaagaaga 

atcgccgtca 

caggaaaagg 

agataccact 

attgccctaa 

ggggctcaga 

acttcctgat 

gctccaaaaa 

aacgagctct 

gccctgctcc 

cttcagaagg 

cagaaaaaga 

taactgcgtt 

ctccggttgt 

tgagcaagga 

ttttagtggc 

ccaatgacta 

accccacagt 

atacagtcct 

tcctgtttgc 

ctaggctacc 

atcttgcgcc 

tcttggtcgc 

aactgagtga 

taaccttcct 

agactgttat 

ctgctggctt 

ctttgactaa 

ctatcaagtc 

tcgtcctata 

gaccctgaaa 

ggcccacatg 


tccgcggctg 
tttggtgcat 
aaggttcttt 
gcagtttcgg 
taaggataga 
acaggggagg 
tgcgagatcg 
ccacctcgcg 
atcgtgggtt 
atctgattct 
ttgtcttagt 
tgtgtccact 
caaccagtcg 
aacgtttaga 
caggcgaatt 
ctggcagaat 
gaaactgacg 
ccccaagatc 
ccccc tgccc 
tcagggaccg 
gccagactca 
taatgatctc 
aactaaccac 
tatgttctcc 
aacagaagaa 
cacacccact 
ggactacaac 
aggtctcaga 
ggggcagact 
cacccctttt 
tcagtccgct 
gctccaagat 
gaggcaggag 
ggagagaaat 
taaaataggg 
ggaaaaagac 
aaagcgggag 
ccccctccct 
agacaccgga 
gaccatggta 
tcagatagac 
cctcttgggg 
cccacaagta 
gtaccggttg 
ccccaatgtc 
ggtagaactt 
agctagaaaa 
ctgtcagtcc 
tcgcccggta 
cccgaacccg 
agacttaaaa 
ttttaaatgg 
acaggggttc 
ttttcgcgct 
ggcggcctcg 
cttggggtat 
ggggtatacc 
gatgatccca 
ttgtagactc 
ggaaggggtt 
gtctctaatg 
tgtggacgag 
aagacctgta 
tctgaaagct 


tcccggggct 

tggccgggaa 

gttctgtttt 

ttttgcggat 

cgtgtccagg 

accagggacg 

tgggtttgag 

ttttgttgcg 

cgagtcccac 

ttgagttgct 

ctcgtgtccg 

cccctttctc 

gtggaagtca 

gtaggctggc 

gtttttcagg 

ctcgtccaat 

gtcgcggttg 

tatccagaga 

caacagccac 

gcgggggaga 

acagttgcct 

attcctttac 

cctcccttct 

catcaaccca 

agaaaaagaa 

gccctcccta 

accgcggaag 

ggagccgcta 

gaaccaccct 

gaccccttgt 

cccgacatta 

ttagtaaaag 

agagagaaga 

ctgagtaaaa 

ctcctgggca 

caatgcgcct 

cgatccaagg 

gagcctaggg 

gcagaacatt 

attggagcca 

aaaaatatag 

cgcgatctgc 

agctggggaa 

catgaagaac 

tgggcagaac 

aaagctgatg 

ggcatccggc 

ccctggaata 

caagacctcc 

tacaatttat 

gatgccttct 

agggacccag 

aaaaattccc 

cgaaaccctc 

aaggagctgt 

cgagtttcgg 

ctccgagggg 

tcgccaacta 

tggattccag 

cctttcaagt 

actgccccca 

agagcgggtg 

gcctatttgt 

attgcagcag 
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tagccctgct 

cccctcatgc 

cgaatgacac 

atcctcaacc 

atggacatcc 

agagctccca 

ggagctgcgg 

tcggcacaaa 

atcgttaata 

atctacaggc 

attctggccc 

ggccacccaa 

aaaacaagtt 

tgaggatggg 

tttttgaaag 

accaattcac 

aagtccccaa 

tcactaatgc 

gagtgtattg 

tgttagtatt 

ctgcccagat 

aggtaatcgg 

ctcagttggg 

tagagaagat 

gaaaagactg 

actccttttg 

gtttccgact 

gtgattagga 

gtaccccacg 

cttgagccac 

gttgacggaa 

gaagaaaacc 

ctgcgccgca 

gtcctgggag 

cctctggatt 

caattgggac 

cccaaatggg 

agctgcacat 

cgggggggca 

ctggaagccc 

gtcaaacgaa 

cagagcaccc 

cgagaacgga 

atacattcca 

ggcagtgaca 

tggctccccc 

taccctccct 

taggaacaga 

ctaaccctaa 

aaggaatagc 

gaaaaaacag 

tacctcagga 

gtcagtatct 

gtgtgtctat 

ccagactctt 

gaaaaagaga 

gagtaggtac 

gcagttatgg 

ttaacttccc 

aaaaaaagag 

ggagtaatca 

agaaaaagcc 

ctcctctcca 

tgcatcctta 


gatcaaagat 

cttagaaagt 

actatcagag 

cagctaccct 

tcgctgaaga 

gttggtacac 

tggtagacgg 

aggctgaact 

tctacactga 

agcgagggct 

tgttagaagc 

aaaggagaaa 

gctcaagggg 

aggtataaca 

aaaaatagaa 

ccacctggga 

cctgaagtca 

aactaaagcc 

ggaggtagac 

tgtagacacc 

tgtggccaag 

gtccgacaat 

catcgattgg 

aaataggacc 

ggtggctctc 

aagttctgta 

ctgaccctgt 

cccagatttg 

ggttccgagt 

ggtggaaggg 

tcgcctcctg 

acgacgacaa 

ggcgccactc 

gtgcttaatg 

tggtggcctg 

ctctcagatc 

atagggagca 

ttttatgttt 

tcagattact 

tcctctaaat 

ggagaaagaa 

tcaggaccat 

aaacaacacc 

ctaagagatc 

ctcgtagggc 

aaaggtcccg 

aggaactaat 

ggaccgtctg 

tatgactcaa 

tcagatcagg 

aaagttgact 

taaagggcac 

ggtgcctccc 

gtctgttttt 

gtatcatgat 

acccgttacc 

aggaaccgct 

atattgatct 

tgtccgaagt 

gactctgtgc 

aagattctat 

aacaaagatg 

ctatagcagg 

ataagttagt 


gctgacaaac 

atcgtgcgac 

cctgctgcta 

tctccctcta 

aactgggacc 

ggacggcagc 

gaaaaaggta 

tatagcgctt 

cagccgatat 

attgacctcg 

catacatgca 

acttggtggc 

ccatgatctt 

taaaagagct 

ttaactcccg 

gttgaaaaaa 

gtggctcaaa 

tacagagaac 

tttactgaag 

ttttcaggat 

aagatccttg 

ggaccagcct 

aaattacact 

ttaaaagaga 

cttcctcttg 

tggaggacct 

cttaccctcc 

ggaccaactg 

tggagacaaa 

accctatttg 

gatccacgcc 

ttggacagtg 

tagacctagg 

aaaaggaaaa 

atctcacgcc 

atactgatct 

catatgggtg 

gccctggtca 

tttgtggtaa 

gggacctaat 

acccctataa 

gcaaagataa 

gtctaagttg 

ctgggttcat 

ccaacaaggt 

actgtaccag 

actctcctca 

gtcagtctag 

tcatgctggt 

acttataata 

ctagcagcag 

ctctgtaatc 

ctagacacag 

aatagttcca 

aatagttctt 

ttaactttgg 

gccttaatta 

tagaactata 

ggtgctgcaa 

tgccttaaaa 

ggctaaactt 

gtttgaaagc 

acctttaatt 

agcttttatt 


tgacaatggg 

agccacctga 

aatgagcgtg 

acaaatgatt 

agaagtgacc 

agtttcctga 

atttgggcaa 

atacaagccc 

gcttttgcta 

gctggtaaag 

cctaaaaagg 

caagggcaac 

aactgaaaaa 

atggtagacc 

aagaaggaat 

tgatgagact 

agattataga 

ctggaaagag 

ttaaacctga 

gggttgaggc 

aagaaatcct 

ttgttgccca 

gtgcttaccg 

ccttgactaa 

cgctcaaaca 

ccccccttaa 

tctttgctta 

aaagcagcct 

gtcttggtca 

gtgttactga 

tcccacgtca 

gcagtcactg 

gaaccttaac 

cattgtatgg 

tgacatctgt 

tagcaaccca 

ttcggggcag 

gggtcagagc 

atggacatgt 

cacggtaaaa 

atatcaagag 

atactgtaac 

gcttaaagga 

tttcacgatc 

ccttataaaa 

ctccaccaac 

taaagcctac 

tccaaggagc 

tatgctatgc 

ctacttcaga 

tttcaggaag 

agacccagaa 

tgtgggcttg 

aagatttctg 

ttttagataa 

cagttctatt 

agaccccccc 

gaacagtcta 

aatagaaggg 

gaagaatgtt 

agagaacgcc 

tggtttaata 

acacttatgc 

agaaaaagga 


acagcaggtg 
cagataagat 
taacctttgc 
ccgtcccagt 
tgactgacca 
tagaggggaa 
gcgctttgcc 
tccgagaggc 
ccgcacacat 
acatta.aaaa 
tagccatcat 
cgaatggcag 
ggtgatccgc 
agtgatcccc 
aaaatttgta 
aattaaaaat 
ctcctgcaaa 
acaacgggga 
aatgtatggt 
atttcccact 
gccaagattt 
ggtaagtcag 
ccctcaaagc 
attagccatt 
cccctggtcg 
tggaagctgg 
ttcatttaaa 
ataccccagg 
gacggcatcg 
caacccctac 
agagggccgc 
acaatcctct 
cctcatgctc 
gcaaccactg 
aagttagcgg 
ccccctgagg 
ttctaccgag 
aaaaggcttc 
gaaacgacag 
cgaggtagtg 
agtgggtgcg 
cccctacgta 
aataggtggg 
agattgacag 
caggggcccc 
tccacagccc 
cttggcttcc 
ttttttagtt 
ctctag cccc 
tcattctcaa 
agggctttgt 
catccagtct 
caataccggt 
cattttggtt 
atttgaacat 
aggattggga 
aatactatga 
taaccaaatt 
aattagactt 
gtttttatgt 
tagatatacg 
agtccccttg 
ttttgcttac 
taaacgcagt 


accgttgtag 

gacaaatgcc 

gccccctgcc 

acatcaatgt 

accctggcct 

gcaaaaggct 

tgaaggaaca 

taaaggtaag 

ccatggggcc 

caaagaaaaa 

ccactgcccc 

acttagtggc 

ccaaaagccc 

tcccatactt 

aaaggactac 

tcccgatacc 

ccatgtgcat 

gaccatcctg 

aacaagtatc 

aaaacagaga 

gaaatcccta 

ggcttggcca 

tcaggacagg 

gagaccggca 

tttcgggctc 

tggaacatta 

ggccctaaaa 

gaccaccgca 

aaccggtagc 

tgcggtaaaa 

cagtcaagat 

taagcttcgt 

caattcaaca 

cagtccatcc 

caggatcccc 

agcggtgtgt 

ctaatcttag 

aacaaaaatg 

gagatgctta 

gctatgataa 

cttttaaaaa 

taaggttcac 

gttggcgagt 

tgagagaccc 

ccagtcgtac 

aacacagtgg 

ccaccgcccc 

ctaaatagaa 

ccttattata 

tgcctttggg 

ctgggccggg 

agcaaaagcg 

ctcactcctt 

cagcttattc 

cgggtccgct 

gtagcagctg 

agaactacgt 

agaagaatct 

attattcctt 

tgaccattca 

taaaagagaa 

gctcaccact 

ttttgggccc 

ccaggttatg 
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gtactaaggc 
tatctcctaa 
cagaacccga 
agaatgcatt 
tagaacattt 
caaataatag 
tgacagaacc 
ttttactata 
gtgcatgagt 
ctattaaatc 
tgtcccgggg 
cagaggaaaa 
agacaagatg 
tgtgtggtgg 
tacctctgtc 
cagataggaa 
ctcaatagga 
gctggccagg 
ctctgtgagt 
aggagcctgt 
tttaatcccc 
attggagcta 
agcttatctc 
cctccacaca 
cagctgcaca 
tattagtttg 
ccttaaaaca 
aagagtactt 
agcaatagcc 
atgagcacca 
ttcagccttc 
aaggctaaac 
ttgtagttat 
ccagtcacct 
ataatggtat 
agccatcgca 
attataaatc 
ccatcctaga 
ggacattgag 
tgttggacta 
gggtgtgtgg 
atttcgaggc 
tgtttcaaca 
taatacatgt 
ttctgctagc 
ggtcctcctg 
ggttttggtc 
catctggcaa 
tgtgagattt 
ttttcatt cc 
tgtcaatcca 
ctagggtggt 
agaaaatggc 
gaccttgctg 
ccttaggagt 
accaaatgtg 
aaaatagatt 
aactggacaa 
cataatgagt 
ctcatgtagt 
tgcacgctgg 
acagaatagc 
gaggaagagt 
gttaattgac 


aacaatatcg 

aagaagtggg 

cccctcccat 

cctgaactcc 

ttgagataag 

gacatgaccc 

ccctagtgat 

taagcctgta 

ttcgacccca 

ttgccttcta 

cttgagtgct 

atgtattctg 

gaacagctca 

acaggaaaca 

cgtcaggccc 

ccaagtatca 

gtgtgttcca 

agagctgcac 

tcaaggcaag 

ttcaaaacaa 

atgacctcaa 

gtgagatggc 

tggggacaca 

tatgcctccc 

tggtcatgca 

agatcgcata 

caaaaatcaa 

tgtagtaagg 

ttccgagtgc 

gggtacactg 

ctaactgctc 

tactgctcag 

gcagaattca 

cttaaaggtt 

tccaccctaa 

ccagcccatg 

tgttggtgag 

ctgaagacca 

cctcatggac 

cccagacaat 

cacacacctc 

ctggtctata 

aaaccaaaac 

gggtaccagg 

ctagtgaagc 

agctttcctt 

ttaagaggtt 

atgtcttgac 

tgctcctcac 

tgagaactgg 

tcacaccaga 

cactttgctg 

taacgactgc 

cagtctgaag 

attgcttgct 

aagaaatatg 

tattacagag 

atgttggtac 

tagtttgctt 

ctatggtctc 

caatgtaagc" 

tctgtgatga 

ctcagtgact 

atggaaagat 


ggtccttcag 

gaatgaagaa 

ctagagattg 

tcaccctaga 

ggcctcctaa 

cttagttacg 

gtaaacttgt 

aaaaatttgg 

gagctctggt 

cattttgagt 

tgagtgaggg 

cctcatgggt 

acctgctggc 

gagagctgcc 

tacatcttaa 

aaccaccagt 

ggatgtggac 

acctttaatt 

cctgggctat 

acaaaaccaa 

atagccaaac 

tcaacatatg 

cataatggga 

acaaataaga 

tgcctataat 

gcaagcagta 

ttaagcaaca 

acaataccaa 

tcagggatgt 

ctctcagagc 

aggcattcag 

catggctgtg 

agagttatag 

ggatgcacca 

tccccaaaga 

gccttgaact 

gtattccttt 

gcagctctcc 

tcagccgcta 

actatgtaag 

tactcccagc 

gagtgaattc 

cgtaaattca 

ggttgaactc 

tgaggcaggt 

tgagagccta 

ctggacttgg 

aaaggagatc 

cactgcagga 

ggaaaactga 

cctaaccaac 

ggctcatcct 

taagaagcag 

tctttaaaat 

gtcacctatc 

gcatcaatac 

tgcaagggaa 

agttcatgaa 

ttttttttct 

aaactcacca 

atgtgccacc 

ttaacttcaa 

gggtgggcta 

caagtccatt 


gaggttgaaa 

taaaaatttt 

ttcccggaac 

gttcgaaccc 

aacaaccgca 

tagattccct 

actttccctg 

ctggtcgtcg 

ctatgttcca 

acggtctcag 

tctcccttcg 

tcagggggtt 

aggagggtgt 

tacagtctta 

aggatctaca 

ttgtagggga 

aaggagaaca 

ccatcactcg 

acaattctag 

accataaaaa 

agctctcctc 

aaagtccttc 

gaggaccaat 

aaatatatat 

ccagcactcc 

ggctagacag 

ataacagtaa 

aaatgttcct 

ataaatactt 

tgcccagaaa 

aggtcacttc 

gtacttggca 

catcatgaaa 

agtgcctttg 

cttctggcca 

cttgatggtc 

gctgataata 

aggagtcctc 

ctagattcgc 

ccaatcccat 

acgcaagagg 

caggccagcc 

ttctatcagc 

aaagtcttca 

acagccggtt 

caaagaactc 

atctgtagct 

acttgtgttt 

ctgcagatct 

ctttgtattt 

tgccaagagt 

catccttggc 

tctttccttc 

gtgtttgtta 

atattctatc 

ccactgcaaa 

aagaaaaaaa 

gaggttctac 

ctttccttcc 

ctgtaaatca 

aggcctggct 

tcatcaactt 

agggcatgct 

gtgagcagca 


actcgctcta 

tactgaactc 

actcctgaac 

tcccaactaa 

aaatgaaccg 

tggcagaacc 

cccagctctc 

attctcctct 

tgtgctttqt 

tgtcttcttg 

ggggtctttc 

tccctcagca 

gggaaaggac 

caggcctacc 

gtttattaaa 

taaaaataca 

cagttgttta 

taagagggaa 

attagccaga 

agcatttctg 

agtccaaacc 

ccaaaaatat 

ttctacaagt 

aataaaaaga 

agaggctgag 

ggctacatag 

caaccacaac 

ttaaggacag 

agaaaacttc 

gttgtttatc 

tgtagtagcc 

ttatcatttt 

gtttccacca 

gggtgataaa 

tctcataatg 

ctgtctcagc 

caagcaaatt 

aatgcagact 

aacctattca 

tttaatacac 

cagaggcagg 

agggctacac 

tctatttcct 

tgtttacgta 

ctttactgct 

tttttttctt 

gtcatatcac 

gctgcagtgt 

attctgcctt 

gggctttgaa 

tctgctgact 

ctgcagttta 

cagaaatttt 

tggtagaata 

aggaagcaga 

aagtgtaaat 

tcagccagtt 

aaaatggctg 

ctttcctttc 

ccttgaactt 

cacacatttg 

gacataacca 

cataagggat 

acacgccctg 


agattagagc 
ttcttca ccc 
tcttcacccc 
aaactgttcc 
ggtacattgc 
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cccccttgag 
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tgctgttgtt 
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aattcaggga 
aagtgttcat 
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gcaggttcat 
gctacatcgt 
aggctttggg 
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aagaagtcta 
gcaggaggat 
tgtaaacctg 
aaaaacccaa 
ttctggaatc 
ccctggagaa 
ctggattcat 
aatgtctaaa 
gtgactggtt 
agttcctgat 
ttatattcaa 
taaaatgctg 
ctgtgtttgg 
cttcaagctt 
ggcccagctg 
gacaagccac 
atattcatct 
cagatctctg 
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tagagaattc 
gcaagtttcc 
ccttgcaaat 
taggtctcca 
agacattcaa 
cccttctggc 
tttagttgac 
tttgtccatt 
tttgttttct 
tccccaggaa 
agtctatcta 
ttttgagttg 
cgtcccattt 
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tcagaattgt 
ggggtgggaa 
cttacaaggt 
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gtttttcaat 
agaatcgtct 
tatcctgatt 
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ttctgaagta 

cgtgtttatt 

aacatgaact 

tatgctaaga 

agctttggaa 

atctgatttt 

agactcagaa 

tgtcattgtt 

ccaggctaga 

tttgagacag 

ctggcctcga 

gtgccaccac 

tgttcctgcc 

tagtaaacca 

cccaaccatt 

tttttcagct 

ggttctgatt 

tgcggtaagc 

ccaggtttgt 

cattggggca 

gatttggcaa 

caacacaaca 

aagagcaccg 

tcacaaccat 

tgtactcaca 

atgtacctca 

cattttctta 

cttgacacag 

ctctggtggc 

agaagcaaag 

tacatgcatg 

tcattaaaaa 

gctttgaaag 

aaccttttga 

taatggcagc 

tagtgttttt 

acctttgtgt 

ggcttacaca 

ccagctcact 

gggttcatgg 

ccatccttag 

ttagtttgat 

aacgagtttt 

agttatgatc 

tattgcctga 

ttgactggtt 

cctccagact 

tgtgatcatg 

gggtggaaag 

agaaatgtca 

tttatatagg 

aactctaagc 

acccttcaag 

tacgcacaag 

acacaactta 

acatgggaag 

nnnnnnnnnn 

nagagagaga 

gctgagtcag 

cgctgtccca 

gctgcctcat 

cagctgggcc 

gtgctttttt 

aatccttgca 


taagaggaga 
ctctgtctgt 
gcaccctgga 
tattttatcg 
ctgaacctaa 
gatggtaaaa 
aagaagttag 
ctgaatgtaa 
atctgaagga 
ggtttctctg 
actaagaaat 
accaggctag 
aatgtaccaa 
gaataactta 
tttggaattc 
tttttcaagt 
agcatttcaa 
agccacacac 
ttctctatcc 
cgtgcaactc 
tgtcaccctc 
cctctcatct 
actgctcttc 
ctgtaatggg 
tatattaaat 
gagtccaaat 
ttgaggttac 
cagacaagca 
acttcctgcc 
gacctactct 
cacatggta c 
gaaaaatcag 
ccaagaagtg 
caaaactaaa 
agtgggatta 
gtgtttattc 
gggtcctaga 
gtaggtggtc 
ggccttcttt 
ccagtgctct 
ctgtggtcct 
catctctctt 
ctacttctgt 
ctcacagtcc 
ccctgctcgg 
ttgttccaat 
gcagggatta 
tggtgctcag 
attccaagaa 
gggacatttc 
ttaacctgaa 
aactaaggaa 
ccccacccaa 
taacatcata 
aaaagagagg 
gttggaggca 
nnnnnnnnnn 
gcctattatg 
aactagccta 
gatttgcctt 
gctaatcagt 
agagccacta 
ttttttttta 
ataggctgca 


aagcttgatg 

tcttgaccgt 

attgcaaggc 

cagcaataga 

ggctgttata 

tcatgtgttt 

gacttgaatt 

gtgcctcagc 

actcctctat 

tgtagccctg 

ctgcctgcct 

gaactcgtct 

atagtcaatt 

tctagccaaa 

tgtgagggga 

tgcaggtcat 

aatgaggcaa 

tcttgcttgg 

taggcctctc 

aaacacggag 

ctcccttctc 

caagaattgc 

tgaaggttct 

atctgattac 

aaataaatct 

gcttcttcct 

tgcttctctg 

gttctttaaa 

atgactgatg 

ccaaagccgt 

acacacatac 

ggctcagcaa 

gatattaatt 

aaatataaca 

gtttcttttt 

atattccaca 

gattgaactc 

aggcttgtat 

gtttaaaaaa 

ttatcttaaa 

aggagaaaaa 

tgaagatata 

tcaacaaact 

ggtgggtcag 

tgcatccaca 

ttttagtata 

atgttcttga 

cactcttgca 

ccagaggaag 

tagacctctg 

a g^9‘9' aaaaa 

tgttgggggg 

gctggttatc 

tggatgggca 

ccatgaattt 

gnnnnnnnnn 

nnnnnnnnnn 

tcgttggttg 

gctgagacac 

tgggggccct 

tgaggttgtg 

a gggg a gtct 

aatgttccct 

agacaggcaa 


agagcaagca 

agatgtgatg 

aaacaaacct 

aatgaaactt 

attcactagg 

gtttctggat 

ccgttttaga 

tgaaaataca 

tttttgtttg 

actgtcctgg 

ctgcttccca 

attacacatt 

gattcctgtt 

gtctgcctat 

atccacagat 

gattcagtgg 

gcagagggca 

aggcttagtc 

tcttaaattc 

tgtgactcct 

cctagggtca 

cattagggct 

gagttcaaat 

ctcttctggt 

aaaaaaaaaa 

cccctgacta 

cttctaccct 

gcagggctca 

atctaaggtt 

cctctgacca 

acagaagtaa 

actttccgtg 

atagttattc 

gagtgtattt 

tagattattg 

gcatgtgtgt 

aagtcatcaa 

ctttggaagg 

agaaaaaaaa 

atcaactgca 

catacagttg 

atcagctcac 

acccaagctg 

aagcctaagc 

ttcacatcct 

tgtgctgctg 

ctgccacaga 

cccaaggctc 

aagaaagtca 

aagtctcaag 

attcttacag 

ggggtagtct 

caaaacaaac 

gattgcattt 

aagagagagc 

nnnnnnnnnn 

nnnnnnnnnn 

cttctaatca 

tgtccaccca 

tgaaatgaaa 

aaaatagccc 

ggtcctttgg 

gctttaggct 

gaatatcatc 


ggcaagcaag 

gctgtcttgg 

tttcctcttc 

agaacaggcc 

atagggacca 

atgatagatt 

gacattccag 

aagagtacag 

cttgtttgtit 

aactcacttt 

agtgctggga 

aacacccctc 

tatttaccac 

tagccatatt 

gctgtagacc 

gtcatgaaat 

tattgtcaca 

agtttctggc 

caaacatagt 

ttccccatct 

gttttacctc 

ggtgagatgg 

cccagcaacc 

gtgtctgaag 

aaaaaaagaa 

cactcacgct 

ggctccttct 

ggaccagtga 

aagcctagaa 

ccatgtgtaa 

aaagagattt 

tagaaaacta 

attatagcag 

tttttgtaat 

ttattatttt 

ggaattggat 

gcttgcacag 

caagcatttt 

agtccttttt 

aacttttatc 

gatggcttta 

atcacactca 

agcagctcca 

gggcgtggct 

ttcctggtga 

aaacaatctt 

gcactaatat 

ggggaacatg 

gaggtgagac 

aacaaggcct 

gggtccaacg 

tccccaggga 

tggtcagtcc 

aggaatacac 

aaagcaaagt 

nnnnnnnnnn 

nnnnnnnnnn 

ttagaaaacc 

ctgtcccaga 

gtcaccagca 

tgcagtggtt 

agcagagtta 

cagtgctgtg 

cctgttttgc 


ccaggatcca 

cttcctggga 

caagttgctt 

cataactgcc 

ctggaagtga 

tatcaatttg 

agaaaactga 

ggaagaaagc 

tggttggttt 

gtagaccagg 

ttaaaggcgt 

tttaattaac 

atgtttctgt 

ttcatcagtt 

gctttagaca 

taatttagtg 

gcacagcaca 

tctaaacgcc 

tagacattac 

gcggttccca 

tcacactcca 

ctcagaggtt 

acatggtggc 

acagctacag 

ttgccattaa 

ggcctgagtc 

gctgcctatc 

gactgatcgg 

cccacgaggt 

actgcacatg 

aaattgaaaa 

gagtacttag 

agatttctaa 

gtaagtttac 

tattaattat 

ttctgcttcc 

taggtggtca 

acttcctgtg 

tgtttaatta 

tggtaaaaag 

tcctgcaggc 

agcctctgcc 

aacaacagcc 

acctcgctgc 

gtgtggttct 

tttgcctctg 

ttactgaaca 

gaggaagagg 

tgcatctcct 

gaaagtctta 

ttagacaaag 

acactcctct 

tgaagccata 

acatacacac 

gggaaggggt 

nnnnnnnnnn 

nnnnnnnnnn 

actctcttag 

gcaaggccat 

ggctccggaa 

cctgggcctg 

acagtcatca 

cgctacttct 

cctcaggcaa 


178980 

179040 

179100 

179160 

179220 

179280 

179340 

179400 

179460 

179520 

179580 

179640 

179700 

179760 

179820 

179880 

179940 

180000 

180060 

180120 

180180 

180240 

180300 

180360 

180420 

180480 

180540 

180600 

180660 

180720 

180780 

180840 

180900 

180960 

181020 

181080 

181140 

181200 

181260 

181320 

181380 

181440 

181500 

181560 

181620 

181680 

181740 

181800 

181860 

181920 

181980 

182040 

182100 

182160 

182220 

182280 

182340 

182400 

182460 

182520 

182580 

182640 

182700 

182760 
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attctgaagt 

attctcactc 

ccacctccca 

gagaatgaag 

tgaacataaa 

gagaaatgtc 

ccccaaacag 

actcatgttc 

gcaccaggag 

tcccacagtg 

cctgggcttg 

gggacgtaga 

aatgccagca 

acagactgag 

agatagtggg 

acaaaaatgg 

tttgtcacct 

gttcccagca 

ctacaaagta 

caatcaatcc 

cttccagagg 

ctccaatccc 

agcatacaat 

gtacagaaag 

tttcctgtct 

tggtgtagtc 

aacaggatca 

ccccggctgt 

caactgagga 

tctggaattt 

gcagaactgc 

gggtctgtgc 

gaaaaaaata 

gaggcaggac 

cctctgatca 

tccattagag 

agtatggtag 

agaagttcag 

gttttgtctt 

tttgetgcca 

atgacgcttg 

actaagtgga 

tcgttgggaa 

gaaagagcca 

ccaaagctgc 

cacaggaggc 

atgaaacgtc 

gggctggcag 

tgcttttcca 

gctggacttt 

ctggctctgc 

gagaggtttg 

ctcaaaggca 

ccaagcccta 

agggggctgg 

attccgttcc 

gaatctgttg 

cgcaggcaaa 

tcatgccagc 

cagcctgagc 

aaaacaaaaa 

caatggatcc 

ttctccatag 

agagttgagg 


ctggcaaatg 

tacttctgcc 

tgctcagggg 

tagaagccag 

gttcagttac 

aagtgctaca 

atacaaacct 

cagcccagtt 

gaaatataat 

actgtggata 

agatctagca 

ggcagaagga 

tttgagaggc 

ttccaagaca 

agagaattca 

agcatgaatg 

acataccaat 

tctgggaggt 

agttctagga 

atcagtcaat 

acctgggttt 

aagggacatg 

atatatatgg 

agatcaacta 

gacatctgac 

cctgtgaccc 

aatcccacag 

ctgtgcagca 

gtgcctcagt 

gtcatttgat 

agaggaaaag 

tttctcttac 

gatgcatcct 

tcatcatgca 

aatcagtttc 

gcctgcatgc 

tatctgttta 

agttatcctt 

aacaaaacga 

agtttgatga 

aactccacgc 

taaatgttaa 

gggaggggaa 

gtgtgccacc 

ccagtgtgcc 

cagcaggagc 

tttccctttc 

ggctcatcca 

aggtcttagt 

actctgctgt 

caaccaccat 

gtctggtcct 

agtgctggtc 

gaggacagac 

agaaatagct 

taacattttc 

ccctctctgg 

acactcatac 

gatgctctca 

tatagaatga 

caaatctgtg 

ctagagctct 

accacatttc 

tttacatggt 


aaagatgtgg 
tgtgccacct 
ctaggaagtg 
cttcaaatct 
atttggttta 
attcagtgga 
tcatcaaagt 
ggctccttct 
agatgtcctt 
cagcccagtt 
ccctaaggca 
tcagttcaag 
agaaacagat 
gccaaggcta 
aggttatctt 
cttgcaactg 
tatacaaatc 
agagatgggc 
cagccaaccc 
catgggctgg 
gattcccagc 
acaacttcta 
gtaaaatgct 
taatgaaaca 
ttcttgcgtg 
aggattaggc 
tgctccactg 
ccgcccagat 
gctcctcagg 
atttccagac 
gggcactact 
aaatcattgc 
cacagtacag 
gcctctttcc 
aggcctggaa 
acgaagctct 
gcatggtgtg 
gcacttacag 
aacaagaggg 
cctgagtttg 
acatgccaca 
aaataaataa 
aaggcaggcg 
aaagctgccc 
accaaagctg 
agccaggttc 
ttcagaccac 
ctctagtgtg 
gtctaattaa 
aaggactttg 
gtcagtgggt 
tgtctaatgc 
cagcatgtgt 
atgctttaaa 
cagcagttaa 
atggtggctc 
tttt ctcagg 
atatgaaaca 
gcactcagat 
gatgctgtct 
ggcttaatca 
gctttgcccc 
tattttgggg 
catttttttt 


gatttgaaca 

tcctctcatg 

gggagaagat 

aaactaagca 

aaaaaaaaaa 

tgtggatgaa 

ctcttccaaa 

catgtgagct 

tttaaggggg 

agtagagttc 

tagtggtaca 

gtcagatgag 

gaatttgtga 

cacagagaaa 

ggactgcata 

tggacaatat 

agattcagct 

agagctctat 

tacagagaga 

agagatagct 

acccacatgg 

ctggtctctt 

atatatataa 

accataacac 

acccagtgcc 

aaccactgcc 

taccccgcca 

gcttggtgtc' 

gcagtgtgca 

tccagctgac 

gtgactgtta 

ggcctctctt 

gatggcaggt 

ctcaactaca 

agaaacggcc 

ggatttgatc 

gaggtatacc 

taagttcaaa 

gccggggaga 

gtcccttgag 

gcccacgcat 

atcaaaggaa 

agggagatag 

agtgtgccac 

cccagacttg 

cacctcagag 

agcagtgaca 

cctgtggcca 

ggttagcagc 

ggcattgttc 

cttcagaggt 

aaaatgcctg 

ggaatcctaa 

aaaaagtcat 

gagcactagc 

aaagctgtct 

caccaagaac 

ttttttataa 

ggcagaggca 

caaaaagaaa 

ttcctagcca 

ctggtctgga 

accactctgg 

gttgttacag 


cagacttgtt 

cacggggagg 

ggatgtcctc 

atgttttatt 

tccctacaca 

acaatcaaaa 

ggctgggtct 

ccgacttcca 

gtggggtctg 

tcgcctagca 

tgcccatgac 

gggtggggag 

gttcaaggcc 

ccctgtcttg 

agactttgat 

tgggttcata 

gggcccactg 

gactttaagg 

tacactactt 

cagtgatcaa 

cagctcacag 

tggtcaacag 

aaatcagatt 

atactgtttt 

tccacaatga 

cctgggaagc 

•gaggacgtga 

actgagtgac 

attgaaactt 

cttggataac 

tcccctgcat 

cacttccctc 

acgaggcggg 

ccgccgcccc 

actcaggctg 

ttcagcacgg 

tatctctgca 

gctagcttgg 

tgcttcgctg 

cttatggtgg 

gaatgtgtat 

tggccactca 

ataaccctga 

caaagctgcc 

attacagatt 

gtggagccac 

gctgtcctgc 

gaacaggcct 

caaattggag 

cattccgtga 

agaagaactc 

gggcaccagg 

gttcaatacc 

gctttaaaaa 

tgctctttca 

ataattcaag 

acatgtggtg 

acctgctcgg 

gggggatttt 

aaaaaaacaa 

gaaggagctg 

gagcttgctc 

gcctgcacat 

taatgagagc 


tggccaaaga 

ggaggggagc 

aaagcagggt 

tccatttccc 

actggttctt 

tgttgaacac 

gaaaagagcg 

aagactgctt 

tctgacaacc 

agcgtgcggc 

cacagcactt 

gcattccttt 

agcctggtct 

tcaggaaaaa 

tccaaaataa 

catattctgt 

gtgcatgttt 

ctagcctggt 

ctaaatcaat 

tagtgccatt 

atgtctgtaa 

gcatgcacgc 

cacaaatcaa 

aaaagctacc 

gacaagcaga 

cattcctagt 

agcctccctt 

catcggagcc 

gcatgtgatt 

agcacagagg 

gattatagaa 

ctttgaagtc 

ttccgggact 

tgcagagttc 

gggatgtggt 

gcataagccc 

cagggagacc 

gcaacatgaa 

tgctgagtca 

aaggagagaa 

acacacacac 

aaatctacca 

tatgaacacg 

cagtgtgcca 

tggccaggga 

aaacctggaa 

agagtctgga 

cagtcacagg 

agagaagggt 

tcaaatacca 

atcctttttt 

ttaatgtcaa 

catcagagcc 

aattctgttg 

gaggacccag 

tcctagaggg 

caaacataca 

atgtggtggc 

gtgttgagcc 

aaaaaaacaa 

gctccagcaa 

tagaaaggaa 

ggaaaaatgg 

tttgaaatga 


182820 

182880 

182940 

183000 

183060 

183120 

183180 

183240 

183300 

183360 

183420 

183480 

183540 

183600 

183660 

183720 

183780 

183840 

183900 

183960 

184020 

184080 

184140 

184200 

184260 

184320 

184380 

184440 

184500 

184560 

184620 

184680 

184740 

184800 

184860 

184920 

184980 

185040 

185100 

185160 

185220 

185280 

185340 

185400 

185460 

185520 

185580 

185640 

185700 

185760 

185820 

185880 

185940 

186000 

186060 

186120 

186180 

186240 

186300 

186360 

186420 

186480 

186540 

186600 
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tcacaaaagg 
aaccgaatcc 
ttatggcaac 
tgcaaagtct 
gtcataaata 
ctaatgagga 
ttctaatata 
taaaatgata 
atccaaagca 
tctcatccgc 
taggacagac 
tattaggctt 
aaaaagcaac 
ttgcataatg 
gaacccggca 
ttgcagggag 
gacctgactt 
ggattgctct 
agcatggatc 
tgcataccag 
aaccacccaa 
ggattcataa 
tacccaaaat 
tttggaaaac 
gcaccaagac 
gccgctcacc 
cacagtcgca 
tttgtgaaac 
gtaattcaaa 
tgcctactga 
taacttccag 
gacccccatc 
agggtcttgt 
gcattattta 
catttcctct 
tatccttgat 
tttctgggtt 
accaacatga 
acaggcatgc 
cagagtaatt 
aatgagttct 
gggtgatgtg 
tggggagctg 
ggtacact cc 
catggcgggt 
aagtttgaca 
ccacaccacc 
caacaagaga 
ttgtgaggaa 
ttagtcggtg 
gaacaccagc 
ccgaattcca 
tggcgctcag 
tatcccctgt 
ctcccctcct 
gaccccctcc 
tttatcaagt 
gtttctattt 
actgagggtc 
accacgcacc 
ctcgcccaaa 
aaaagctacc 
acagggcggg 
tcaatattta 


aaaataggaa 

taaaaggaat 

tgacatttaa 

tgcaggaact 

tagcttattc 

aatatttcca 

atttatcaat 

aatagcaata 

gtgtagctat 

agatacagtc 

gaaggggggc 

tgttataaga 

tacttggggt 

aacaattttc 

caggctcata 

agaaagcagt 

taaacagttt 

caaagactct 

tccggccctt 

gaggactgaa 

actcatttcc 

aataaggcag 

ggaactgcat 

tggcacagct 

ccagggctga 

aatcccggct 

ctatgctctc 

tgttttcgaa 

actaattacc 

ttgtgcgctg 

gctaaatcag 

cctgacttcc 

agttctggct 

atttaggcta 

tcttcaattc 

cgtcccaccc 

tgcaaggtgg 

gcggcgcaca 

acacaccggc 

aggcctaggc 

cttcctttgg 

gctgaaacca 

cagtcggaag 

cagaaataga 

ttttagacat 

gagtgatatt 

gccacccaag 

gggagccggt 

ttgagtgtac 

aagcaggggt 

tcgaggaaca 

cagggcttaa 

caatctcatt 

atctgactca 

cctgcctatg 

actggactct 

gttctgttgg 

cctttacatt 

ctggtagttt 

aagcaaaaat 

gccagattta 

acttggcaga 

atggggcggg 

acctaactgg 


aatatgcctc 

gttcaagtaa 

aagtaacagg 

tcatgctaaa 

ccaaatcctt 

caaaaaccca 

ttaaacacat 

gtggtgaggg 

atttaaatac 

tagactcaga 

aataagaaat 

ttacatcaaa 

ttgcgcctgg 

attcaatcag 

gtttttatct 

cgggaggcat 

ctctctggag 

cggactcctg 

ctcactgctt 

gtttggctcc 

gacttggttg 

aggcagagaa 

tttcatgcac 

gacattggct 

ggcctaatcc 

ggagccgcag 

ccgtggggag 

aacctgggat 

ctgagcacat 

cagctcactc 

gcagccacag 

aggcgaggag 

gccgaattat 

gaatgacctc 

gtggccccca 

cacttggcta 

tcaggtccac 

gccgccatca 

agacatgtgc 

agttcctgaa 

gggtgctaaa 

tttctgcagc 

aggcacaatt 

cagatgaggg 

ttaggaatat 

accgggttta 

ttatcacatg 

ctatctttgt 

aacactccaa 

aactgcgctc 

ccaaaaaggc 

gtattaagcc 

tattatttct 

cattctcctg 

catgctttgc 

ctcaccccac 

tgagccttct 

tcttgtagcg 

tagagttgga 

tgcacccacc 

ttgctcctgc 

agatcaagtc 

tagggtggag 

taatttgctg 


ctaaaaagag 

aaaaaaaaaa 

atttgggcaa 

attatgcttt 

agaatttcta 

cttaccatta 

tgcataatgt 

gtggggggca 

ctcaatccat 

gcagcatatc 

gacaggaaac 

gcagttcata 

gggctgcctc 

gatctcagca 

cccagctcca 

tgtcctagtg 

gttgaaaggg 

ttacaggcaa 

tcttgaatca 

c ggtgaccag 

gtcaaatttt 

acggagggtg 

aatagaaaac 

agaggaagga 

gcctttatcc 

aagagctctc 

gccgctccgg • 

gatcatttaa 

ttgaaacatt 

tggtgtttct 

gcgctgcctc 

gcggcccgtt 

tgctcttatc 

tttccctccc 

ttttctgatt 

catcttcatc 

cagtcatcct 

ctgagagagc 

acatgcgctt 

gcaaattcat 

ccagcatgcc 

caagcctgtg 

c tgggatcaa 

ctgcctcctt 

aaaagtaggt 

actagagcca 

agccataatg 

ccaaaggaaa 

taacatcccc 

gagcagtctg 

cgattaatga 

ccaaagaaat 

ctacaaagat 

ctcagtaagc 

gtcttcacaa 

atacttggaa 

cttgcattaa 

tctcacatgg 

gttagatgac 

cttccctcag 

tgtaaagtgt 

agaagtgcag 

cgaataattg 

tgacaattac 


ccgaggcaaa 

atatggctca 

gtttctttct 

aattttaatt 

ccctgcaagg 

agaaccccca 

gccactctgt 

aaaaccagga 

tgtagaggaa 

cttgac/tgta 

ttcagaagaa 

tgttttaatc 

tgtgtactga 

gagatagctc 

cctgctggag 

gctgtgtacc 

gctctgtaag 

gtaaggtcct 

gggatttaga 

aggacaaggt 

caagtttccc 

tgtgtgtgtg 

ttaaagactg 

acggccaggg 

gagggtttag 

ttcacttggc 

gagggggagc 

atgtttaaaa 

tatgccatca 

ataaactgct 

cagccctggg 

tctccagaga 

cgtgttcata 

gagtcttcct 

ggtccaaata 

tgggagccaa 

aaggtgtgtg 

acgtgccctg 

tccccagcaa 

ttcccccttt 

agtggctaga 

ggcagaagct 

gaaatgagca 

attagcgctt 

tggattccca 

ttaagagact 

caagagaatt 

tgagcagccc 

tgcaggattg 

cctgtgtacc 

caaaggacac 

caaggtctag 

ccaacactca 

catcctggtt 

cgacagctgg 

ctactccttg 

agctgtgaga 

gagacaccca 

ccagcaacat 

atgttcctgg 

atcttctcta 

gctagcaggt 

aagctccaag 

gccatgaagg 


ttattatagc 
tgcgaagttg 
ccaccctctc 
agccaaatag 
agctgaacta 
tccgatattt 
agcattccat 
gattaaacat 
aacacactgt 
agggtattaa 
ataaaatttc 
tggggaggaa 
accagacagt 
ctactcaaag 
aaaccttgta 
taaagttaca 
ataccagagt 
agcagatggt 
aattgctatt 
cattgtttaa 
agcagtctaa 
tgtgtgtgtg 
aaccaatcat 
cgagccagct 
tgaggctccc 
tcagtcccag 
gacatcaagc 
tatgcacatg 
tcttggatcc 
tcagcgattt 
ttggtggaga 
gccgtttgtc 
attctcatct 
ccctcattcc 
tatagacaaa 
tgtggtgagt 
agagaggtag 
cagctcaggc 
accctgcttg 
tccagaataa 
agcctgagat 
aaccttgggc 
ctggtttata 
tgaagatg cc 
cagtcagctg 
cttcattatc 
ttcattccat 
agcgtgaagc 
cctctgcgat 
tggcttgcaa 
tcatagaggc 
gccattctcc 
atttcccagg 
tgaaacgggc 
taatttgcaa 
gaactacttg 
aggaaccaca 
ggttagatat 
gccttccccc 
catcttataa 
agcctcactt 
gacggtgagg 
agttaccagc 
gaacgctgcg 


186660 

186720 

186780 

186840 

186900 

186960 

187020 

187080 

187140 

187200 

187260 

187320 

187380 

187440 

187500 

187560 

187620 

187680 

187740 

187800 

187860 

187920 

187980 

188040 

188100 

188160 

188220 

188280 

188340 

188400 

188460 

188520 

188580 

188640 

188700 

188760 

188820 

188880 

188940 

189000 

189060 

189120 

189180 

189240 

189300 

189360 

189420 

189480 

189540 

189600 

189660 

189720 

189780 

189840 

189900 

189960 

190020 

190080 

190140 

190200 

190260 

190320 

190380 

190440 
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actatgcaag 

ttaataacac 

cagcatatgg 

ttcaattcgt 

catttgactg 

gcttcccaca 

aactttaata 

gagcgtgaga 

tgagacaaaa 

atgggagttt 

ttagtggttc 

tatggagatg 

gtagaagtgg 

tgcaccccaa 

aaattttagg 

tcgagttgag 

ttttttttta 

tggctccctg 

aaccaccgaa 

tacaagctct 

aattaccgac 

ttaaaagttt 

attagggact 

aattatacat 

aggttatgaa 

cagctccaag 

ggatcactta 

ccatcttccc 

gggtctctcc 

tttttttttt 

aactcacttt 

ctcctgagtg 

tgaagttccc 

tccggctcag 

aacccccaca 

gcaaggctac 

tgtcggaggg 

tgtcctttgc 

gccttggtgg 

atgcctccct 

atttacctgc 

agagaccctc 

gcccatcaga 

gtgattcagc 

gccactttta 

gggtatataa 

cctcacgcct 

actaagtgtg 

ggtgcctggt 

gttatgtcac 

tgctctcgta 

gaactgtaag 

aaagtaactt 

ttgtgtctgt 

ccactgacat 

gtcattcaga 

cctagggtac 

ttgtggatta 

acatcaacag 

cttgaacaaa 

tccggccttt 

ttatttcatt 

ctaaggttac 

tgagatgaag 


aatgttgctc 

acagaatgag 

ctgtgcacat 

cctcagatag 

acaggcatct 

gcccggcttt 

aggattttct 

caggaatggg 

gaatattaat 

tgaggcatgg 

atttatggag 

tagcttataa 

tgggggtggg 

gcctatgttt 

ccaatagaaa 

cagtggtgtc 

aagtccactt 

cagcgagagg 

agggaactgc 

gagcaagcta 

tgggtaatcc 

acacaatcac 

ttttgggtgg 

tgagatttgt 

atggccaggt 

tcataagctc 

ggctacccac 

aacatggctt 

aatgtggact 

tntttggttt 

gtagaccagg 

ctggaattaa 

agtgacctgg 

aattttgggg 

ttgggagagg 

acagcaagaa 

tgtgaggcct 

tccattactt 

ctgcaatctg 

ccccaccctc 

cagcctttgg 

acagaagcac 

agaacccagg 

cccataaatt 

ttgaaataga 

gagataatcc 

gcctgcaagg 

gaggtatgga 

tggtggcacc 

tgggtatggg 

aggttcctgc 

ccacttactc 

gtgtggcagc 

ctgtctgtct 

gtctcagtcc 

ctggttttga 

tcgaccagac 

tttgtgtata 

tatttagcat 

tgaaaatcaa 

taatacagtt 

gcttaaccag 

ccaaccacgc 

aacaatcaac 


tctaattaag 

ccttggctcc 

aggccgtgag 

agcccagaga 

tcccggaaag 

caaccaggaa 

cacagaaaga 

gacacaagcc 

atcttggcta 

ctaagctggg 

ggcttgctgc 

aactactcag 

gtggggcaag 

agggttgatc 

tgacctctgc 

aacggagaga 

gcttgggact 

ctccagtttt 

acagcacaca 

tctgaagaag 

actagggagc 

ttttgagttg 

tttactcgag 

caactgctgg 

aaggttgggt 

aaacaagtct 

ggacaagcac 

ctatgttgct 

caaatcatga 

ttcgagacag 

ctggcctcgg 

aggtgtatgc 

atgtcaactg 

ggacaaggta 

cagaggcaga 

gctgtctcaa 

tgggggccag 

aatcagaatc 

catttaagag 

cctgcatccc 

tgggacacag 

ctgaatgtac 

cttatgctaa 

tacacatccg 

aaagatgcct 

catgttgtct 

tagccaccaa 

taaaaaaaaa 

gtttggggag 

ctttgagagt 

caccatgttt 

tcaggtctct 

cagctaagca 

gtcattccat 

tcttggtatc 

gcttgcaatc 

tgggagtagc 

aagatctgat 

caacttaata 

acagcagtac 

cctcgtgttg 

agttaactgg 

taggaaggag 

ataacaggga 


agggctctgc 

gggagctaaa 

tgatgcagcc 

gcgcggctca 

cctgcgcgtg 

ggcttggcgt 

aaagtccatg 

aacaccccaa 

tgaaggagga 

caaaccattt 

ccggagagcc 

attttaaaca 

agaacaattg 

agcttcccga 

accacggctg 

gcagaggaaa 

cacctgaagg 

cccaggcacc 

agttaaatat 

ctgtcatttt 

aggtagtttt 

actataagta 

gctgcaacca 

ggagtagtgg 

gcttccaagt 

tcaaggggcc 

ttctcataca 

tcaacaacca 

ctacagcntg 

ggtttctcca 

actcagaaac 

taccacgccc 

aagttggatt 

ccttgattta 

ggcagaggca 

aaccaaagac 

aacaaggtgg 

gccatcacag 

ctaagtggga 

tccatttacc 

gcttagtggc 

tttcagcgct 

ggagccagaa 

tacagccaaa 

attctggagt 

ttgatgtggt 

gccacaccca 

tgtcccaaga 

gcagtggacc 

ttgtagcttt 

cctctgccat 

ttcagtcctg 

agggctgtgg 

ttatatagtc 

atatattcac 

ctcctgcctc 

aggttctgtt 

ggcccgaccg 

aactcatttg 

ctgttctcct 

cggtgacccc 

aa< 39'gttaat 

aggaaagggc 

cagagcagtc 


atttcctagt 

ggttccatta 

cagttaagcc 

ggccctcacg 

cctacactgc 

gggtctgatc 

ggaacaaatc 

ttgctaggct 

tggtgccatc 

tctttttttt 

catcagaaga 

aacagtgcag 

catctgcaga 

ggcaagccca 

actgaagcta 

gtccaatcag 

cagggcattg 

agcccatcgg 

aggctgggtt 

taatgacggc 

ggaagaacag 

tttcacacga 

acaatgagtg 

agggtcctgg 

ctcaaatata 

tggagagtta 

aggaccggct 

gggcagggtg 

gggttttttt 

tatagccctg 

ccgcctgcct 

ggccgagtcc 

ttactgtgat 

actgggcact 

gaaagttggt 

atctttcttg 

tcaaggaaga 

atatagctag 

taaactcagg 

tgtttccagg 

ttagcgctgc 

gcagagcacg 

agtagaagca 

cccacttgaa 

gctaagtggt 

gctagggaga 

caacctctat 

cctcacgaat 

atttggtctt 

gctccccttc 

tatggacacc 

gagtcttatc 

ccgactgctt 

ctgagaattg 

ttaagacaag 

tgcctcaagg 

ctcttagctt 

actcccttcc 

gtaaagccat 

agagcagcgg 

cccccccccc 

aataaaacca 

cactcgcaca 

cttgtaacaa 


cacccgcact 

ggagcacggg 

cgctaacacc 

ccacgagccc 

aaatggacct 

cttcaagagt 

ctcctcttaa 

aactctgata 

ttctgaattg 

ctcttcttaa 

gagctcgctt 

gaggccagag 

aggctagccc 

gaagcctcta 

taaataagcc 

agcttcattt 

agtagagcct 

ttggttacct 

atzctgcattt 

acaaacttcc 

ttcaccatta 

ggcaggtggg 

ttttctcaag 

taatgcagaa 

ctcctaaggc 

agacaaataa 

acctccaaca 

aattaggggt 

tttttttttt 

gctgtcctgg 

ctgcctctgc 

gtcttgataa 

gactactgag 

acacgactgt 

tggaggctag 

atccaaatcc 

ccactgactc 

gagattttaa 

ggtgggccca 

gatctgctta 

tcggggcacc 

cacggctcag 

gctggcaaga 

gtgatccaga 

acaggagggt 

taacccagga 

ttatacacac 

ctgcaaacat 

gcaggaggaa 

cagttaactc 

tggtcctcta 

atagcaatga 

gggattatgg 

aaccacttta 

atctcattaa 

cgataggatc 

tctacagtga 

ctttaagtga 

ctccccacct 

ctctcagcct 

agccgtagaa 

gtctgggaga 

aacctgtctt 

gtgcaaagga 


190500 

190560 

190620 

190680 

190740 

190800 

190860 

190920 

190980 

191040 

191100 

191160 

191220 

191280 

191340 

191400 

191460 

191520 

191580 

191640 

191700 

191760 

191820 

191880 

191940 

192000 

192060 

192120 

192180 

192240 

192300 

192360 

192420 

192480 

192540 

192600 

192660 

192720 

192780 

192840 

192900 

192960 

193020 

193080 

193140 

193200 

193260 

193320 

193380 

193440 

193500 

193560 

193620 

193680 

193740 

193800 

193860 

193920 

193980 

194040 

194100 

194160 

194220 

194280 
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gagagagagg ctgagtttct acttctataa ataaaccctt ggcaggcgga tcactaaagg 194340 
aacacaagtc aatataaacc tttagacatg gggctgccaa acttcacttt tcgacagtat 194400 
attaattatg tagtcaatag ccatgggttt cattagcgta ttaaatacca cgatcaatat 194460 
tatttatact tttcgaagac aagccactca gggaaaaaat ggtgggggga ggaggaggaa 194520 
caatttgacc ctgtagttca aaaaaagtca gaacagcaca ctagagatta gcaagggttt 194580 
aatggaaggc ataaaacact ggaaatatgg acagaaatca gatccctgcc ttcatttttc 194640 
tgccttttac aaagagactg gagggaattc agaaactatt taaaataaag gcaaaatgat 194700 
tagagcccct ccctcccctc agctgcttaa cactggggtt gtggtggacg caaaataagc 194760 
attgagctct aagtgataga tgagaatcag aacaggaaca gtgtttttga ggcaaaatat 194820 
gtccaagaga attcaaagaa ctgtgggcca gaatctactt aggcag.tcct ctgggacccg 194880 
aatccctcac aggcgttaac agtggaacca atttccaagg cagccctgct ggtgatctga 194940 
tttttgagta gggaaatctg ttaaacatcg tcccacgagg gagcccagct ctttcactcc 195000 
ccacgggttt ctacatgcag ctgtgctaga tctgctgaag tggccggtga ggaggtgtgg 195060 
ggattggttc agcgacctca gaggacattc ttgttcacta gccctcgtgc actggggcga 195120 
tgaccgaatg ctgtgagcag gagatatcaa aggccggcta ctggactgaa aactagatca 195180 
ccatctctaa cctgcaattt gtcaatctca gacagcaatg aagactgtga ttttctagtc 195240 
aacgctttgt aagcaaggtc agatagaggc tccataaaaa ttgttcaggg ttcaggcaga 195300 
gaatcaagtg taactcaatc cctatctcct gagattaggg aagggaagga aggctgtgtc 195360 
tactaaacca gtgagcctca agcaaagcct gtctgttctc agcaaggtga gccacccacc 195420 
aaagatgcca acagctaagg gccagggatg tagtgcaggg tgctgtgata tcaacagctg 195480 
ggagacagaa acaggaggat caggacttca aggtagtttg ggctataaaa tataagcttg 195540 
aagctaccca cttgaagact gtccccaaca aaacaaacaa gctgggtatg gtggtcgatg 195600 
cttgtatttc tagtgtatga gacgaaggaa gaaagctcaa gcccgagacc tgcttgggtt 195660 
acatagggaa gatggtgcct caaaaacagg acagccgagg agcagacaga cagggcagac 195720 
agggtgcatc gatctaaatc cacatacctg gatttaaagt aatatctggg agactggtct 195780 
gtgagggccg ttccagagat ttaataaaga cccaccctga ctgagtatgg gcaacaccca 195840 
tgggtggccc aggggtccag actgaataaa gggaaaatgg ‘ gaggaagttc agctggtagt 195900 
gtttccaagt attaggacca cagcctggcc cctggcatgt gctggccagc tagtctagct 195960 
ccagtatcaa gcttcaggcc agcggcaggg cactggacag ttcccacaca cgacacacac 196020 
acacacagag cactagcatt cacctcctgg tctcttcttg acaacagata aaatgtaact 196080 
ggctgccaca gtgagagtcc cctaccttcc tcaccgttaa ggatggtaca aactgtgagc 196140 
cagcagcagc catttctcca gtaacttgct ttccacagat actgttatag cactaagaaa 196200 
agcaactgaa acatggggtg ctgtgacccc ttggcaccac aaagccatgg caagctgaag 1 96260 
tgcacacatc acaggccagg cctgaagatg ctgggggact gcaatgctgc ctggattctg 196320 
gcagagatgt gcagcagatg ccaagaggtg ggctgcagca accagagata attaatatga 196380 
ttaggaacac actgagcagg catgctcttg ccgaatgaaa agcctcgcag tgtaatgact 196440 
gttttctt cc tcgatcacgg tctccacgtt tcagagttgg cttggtgtta ggctgccgcg 196500 
taaacatcaa tccaaccccg aggggccaga tcatcggtgt tcctgggctc aatcgccttt 196560 
ccttttgtgt tttcattcat ttaaagatgc attccagggt tgcaaacatt agtgagaatc 196620 
atctccaggc ctcagtctaa tctctgagtc tgtaatgagt taacatcttt ccctagtgaa 196680 
tatttattat gaaggctaat taattgcttt ccagttacaa gaatccttta cagtcaaaga 196740 
aagtaggatc cacaaagata tactgtttat tcaaacaaag caaaggaaac aaagcttctt 196800 
tcttaaattc tatttaacat agctttaata aaggtacaca ggtccgcctg gcaaccgaac 196860 
ggtaactgat gcaaactgaa gccatgctct gtagcagcct ggatgtccca gtgccacctc 196920 
tgtctgcagg ctttgtcgga tttactaaga ttctgttatc ttcaaacagg gattgtgtct 196980 
caagtaactg accccactat gtggataatg aagtaaatta tgcaatttgg gggtttgctt 197040 
ttccccaagg ggacagcaag ccagtgctta tcagccgtcc tcagaggaga caattctgat 197100 
taatatcaga gtcatctgac tcagtctatt aaacctatca aaccctgaag gaaggatatt 197160 
cagatattaa cgataggcct ttgattaata attctacctt gttgccattc taagcattaa 197220 
caaccatgca gtaactctgc aaaacagacc ctttgattcc aggcagacgc accctctgaa 197280 
cacctgggtt ctcccctact cttctccccc caggaggaac tcaagacaaa aaggtgccac 197340 
cactggaaaa gcacactcca ggttacataa tttgcctcat tatccagagt ggggttaatg 197400 
acttgtgaca taatttctgt ttgaagataa caaaatttca tgaaatccga caaagccgga 197460 
aggcaggagg aggggactgc tgccacacta ccggtggctg agaactggag cggaaggttc 197520 
acacacagcc ctctgagctc actgtctttg cttatcagtg agtcccaaga ggggcccaga 197580 
tgggttgcca gcctccccta gaggatcttc attgtggagc tgtcccatgg ggcgggaagg 197640 
aagccattct atttctgttc ttctctcttc cgttctggcc accagtggta cttgctccca 197700 
tcacatgttc ttcctgatgt tcgcgatcag ccgtctgcca tagtctctga agtccacggg 197760 
cttcacgtcc atcacagtgg ccttaattcg agattcatcc tttagaaaag agagaagctg 197820 
tttgtgagtg gcagagcctg gcgtgcagcg gaagagagaa ctttctttgc ttcagtggct 197880 
tcaatgagtc cagcaggaag aaggaaagtt tacaagtctc agagagaaag tgctgtgact 197940 
tcctggagtt gggccagatc ctcttccaca gaccctttcc ccatcctagg tgccctgtgc 198000 
tcagacctag catcctcccg gagaagcctc tgtctttcta tgggtgcagt gggggcccag 198060 
agcagacagg taactcaccc taaagcatca ctttcatcta gaggagctct gtggtagtag 198120 
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ggactgaggc 

tccatcccag 

ccagcctacc 

ggtgataacg 

cacttgtctc 

cacaagtcta 

ccgctgttta 

ctccatggct 

aagcttgcaa 

ctagatagat 

ctggacatca 

tttgtttaat 

annnnnnnnn 

nnnnnnnnnn 

ctcctgctgg 

cacagtgtca 

gaactcgagt 

gattccctgg 

tgtaccgatg 

gagaacagca 

gagggggggt 

aagttccaca 

agaaacagat 

ttttctcctt 

tatatctaag 

cattatggat 

agaacagtca 

tgagttaggt 

caatgatact 

ttgctggtct 

gttatgtacc 

gtttctctgg 

ctcagaaatc 

cccagtttaa 

ctggatgttc 

cctctgcctc 

ttttttaatg 

cccagagcgg 

ttttaaaggc 

cacacccgta 

tgctggtctg 

cagtttgagc 

gcagagaagc 

aacaggaaat 

cttctctact 

aaattaaact 

gcctggttct 

gcctcactag 

tggcctgttc 

cagaaccaaa 

ctcttcagca 

acgtgcggtt 

tggagctcag 

tgcggagggg 

ctgcacccta 

cactgcgagg 

aatggcccag 

cccttcctca 

tgatgcccgg 

gcgttggctg 

gggtgaagca 

gactcgagtg 

cgtgcccttt 

ggctgcccag 


ttctgctcca 

gaagacagaa 

tatgttcatt 

cagaatgcac 

cttgaactaa 

gctgggggcc 

caagtatggt 

ttaggttcag 

caacagctcc 

ggggcaggac 

ggattagttt 

attctgagtt 

nnnnnnnnnn 

nnnnnnnnnn 

ggttacaggc 

agctggatat 

tggtgaaata 

tcacatgctt 

catgcttgca 

agtgaacaaa 

gtaaagctct 

tgtaaggtca 

gcagtctgtc 

cctttctttt 

tacactgtag 

ggttgtgagc 

gtgctcttaa 

tttgtgtagc 

ctgcctgatc 

ggaactaaca 

accacacctg 

gtagtcctgg 

cgcctgcctc 

gggttttttt 

tggaactcac 

ccaagtgctg 

tgtgtatttg 

aagtgttctt 

accaaactct 

cacacccgta 

gcctgtgtat 

acaggctctc 

atctgcaagg 

ctcaggagag 

tcccagaaca 

ccttggtttg 

gctgctggca 

atccgcgtct 

aggatgatct 

cgagcctcgg 

ggggtgggga 

gtttccgtgt 

cgctggtgct 

taagaaaacc 

ccccccgctc 

a gggccgtgg 

gggagcagaa 

ggacctgaca 

cagctggccc 

gctagtctcc 

gaagtcagga 

gaggtgcatg 

gaggaggggt 

aattaccgga 


gctctgggca 

aatccctaca 

taatacaaca 

aaactctgcg 

acagtgtctt 

ttcatgcttc 

taggagcccg 

agtcattctc 

aacacagaaa 

ggactgtcca 

aaacacttgt 

ttagaaccta 

nnnnnnnnnn 

nnnnnnnnnn 

ttgagccacc 

gatggcacat 

gtccaccagg 

taaggaaaga 

gccacccaca 

caaacaagcc 

tggcaggtgg 

tgtgatcgag 

atatctaaac 

ttctctccct 

ctgtcttcaa 

catcatgtgg 

ccgctgagcc 

cctgggtggc 

tgtcttaatc 

gagatctgcc 

gagtttaagg 

ctgtcctgga 

tgcctcccaa 

ttgtttgttt 

tctgtagagc 

ggattaaagg 

tatgggtgtg 

aaccgctgaa 

taggtaggag 

cacaccacat 

gagctggaac 

tggtttctgt 

ctgtgaccac 

tatttccttt 

ttgtgtgcca 

cagagtagca 

gagctgacac 

cagagtctgc 

tcctctgcat 

acgaagcccc 

caagagcaag 

ctgggagatc 

tcgcgctccc 

cccaggcttt 

ctcgttcatc 

ccaggctcag 

ggcggaaagt 

tggaggagct 

tggcgccggg 

gggagcgccg 

ctttactgca 

ggctgcggaa 

cctgtagccc 

acctgtggcg 


aggttacttc 

ctctcccttg 

actaaaatat 

gctgcagggg 

tgaggcagaa 

catgtgctta 

actgcctggg 

tgcatgcctc 

gtgctgtgag 

gtaagcaggg 

caggtggggc 

aactgtggga 

nnnnnnnnnn 

nnnnnnnnnn 

atgacagctt 

gcttgcaata 

taaaagagct 

acttgccgaa 

cacccacaca 

gggggggggg 

ctggctcctg 

tacatctggg 

cattgttgtc 

ttctctttta 

acacaccaga 

ttgctgggat 

atctctccat 

gttaccttaa 

attttgagat 

tgtttctgcc 

gttttttgtt 

actcgctctg 

gtgctgggat 

tttcctgaga 

aggttagcct 
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gtctctaaat 

tagtcccagc 

atgccaggtg 

cagggggttc 

acacacacga 

catgttagta 

tttatttttt 

atcgcaacac 

acagagcaag 

acaaaatatt 

ggctgtcctg 

tgcttctgcc 

ttatagattt 

aggggtciaga 

ccatgctggg 

ctagagtttt 

tttcgagaca 

gctggcctcg 

gtgcgccacc 

ttgttttgtt 


tctacgtatc 
taataaccca 
ggtagatagc 
gacatcat cc 
ctggaaagca 
gccggccctg 
acctttggga 
cctgggccac 
gaagacaacg 
ctggcgtact 
cagatccgcc 
agcagtgaag 
tacaactatg 
aaaagcgtct 
gcagtgcacg 
ctgctttact 
gcgtattttg 
tctgtgggac 
caggcgtctg 
tggcttcttg 
tcagggcaga 
tggggctgga 
ttttctgagg 
ccagggctct 
gcacagtgga 
cacacaccag 
agccaggctt 
agtcaggaaa 
tgggtggggc 
gcctactggt 
atggcgacgg 
gctgaaccct 
tggcttct cc 
ctggccagcc 
tcctttcatt 
ataactactc 
ccagctgcac 
cacagcgatg 
tcacgccctg 
gagaccactg 
gcgcaggcct 
caggttctgg 
ctctcctccc 
caggctgccc 
cccaacattc 
tctcggagtc 
ttctttgaga 
aacacccttt 
acacaaataa 
cgattggatg 
ggtcttaaaa 
tcaggagcag 
tttcagggca 
taaaagattt 
gaactcactg 
tcctaagtgc 
ggtgttttgc 
ggcgagtatc 
actgggtcat 
tttcccccct 
gggtttctct 
aactcagaaa 
acgcctggct 
tttttctttt 


gataaaggaa 

gagtagccca 

taccttgcca 

ggaaaatcat 

tgagcccgag 

aatgggaccc 

gtgtcctggt 

tgggctccct 

ccctgaagcg 

ggcagtacga 

tgcagagctt 

acttctttct 

gggacgggac 

tctacgtggg 

tctgggaccc 

gtgccttagc 

acggggaaga 

ccaagtttga 

ccttctacct 

tacgacgtca 

cggaggtcgc 

cccttgactt 

gtcactgggg 

ttgatagcca 

tccttcagac 

catcaccatg 

gcaggtcagg 

gacactcagg 

gctggcacgg 

ttcgtgggcc 

gcacttgaac 

gggcttgttc 

tcgggcttca 

catgaagggg 

tctacttagg 

ggggacattc 

aggaggcagg 

ttctaatgag 

tccgtgaccc 

agcagagtcc 

gcgcatcgtg 

gaggggagaa 

ccgccgtcca 

ccagctctat 

taagaaagtg 

ttctctgagc 

ggacctgagt 

atggcttctg 

aaataaatgt 

tggcagtgcc 

ttttaaaaga 

aggcaggtgg 

gccaaggtta 

acttacttct 

tatagaccag 

tgagattaaa 

ctgcttgtgt 

agctcttcct 

ttgcaagagt 

ggctgtcctg 

acataggcct 

tccgcctgcc 

cagaatctgt 

ctttttttgt 


gctgttttgg 

gtcagtacct 

tcttccctga 

ccccaaccat 

ctctcgaaac 

tcagagtggg 

tggaaatcgc 

ctctggagtg 

ggagctgcct 

gatcggtgtg 

cccagggcac 

gagtggcagc 

caatgagacg 

ccagcttgag 

cttcacaggt 

caggcctctg 

ttcagtgagg 

atccactccc 

gtaagaacca 

cctgtcgtcg 

agctagttag 

cactgtggaa 

aatgggatgc 

ctaagtcccc 

agccgggtgc 

gccagctccg 

aggggtgcag 

ggactccacc 

atggggcttt 

tgaatggggt 

acaatctcct 

gctcgttggc 

tggtgctcct 

acattctaca 

gcctggtctg 

agtggggtgg 

ctaggggcag 

taacccttgt 

agttcaggtt 

atgccccctg 

ttcctgcctc 

atagggctgc 

gatggaggtg 

ccaggacccc 

tcaggatggc 

accagttctt 

ttggttctca 

tggacacata 

taaaagaaga 

tcacagaatc 

ttggtttagg 

atctctatgg 

cacagagaaa 

tatttgagct 

gctggcacct 

gtagtgttat 

atatatgcac 

ggaactagag 

tacaagtact 

aagtagaatc 

ggctgtcctg 

tctggctctc 

ttttaaatga 

ttttgttttt 


ggggaggaaa 
tgccttggct 
tcttagaact 
gagttggtcg 
ccagccagca 
agctgtctcc 
atccagatcc 
ggtagtagcg 
cggagtgcqc 
agccag.cagg 
acgggggccg 
aaggaccgga 
gcttcccgcc 
gccccgcagt 
gagcgggccc 
ggaacgggac 
caggaagaga 
ccaacttacc 
aaaatttaga 
ttgtaaagag 
agcatgctat 
gcagcaagat 
aggtgtgggg 
agatgtgtcc 
ccctgacggc 
actccactct 
• ttcctgggct 
aggaacgctg 
tctcttccgg 
gagctgcagt 
cctatagcat 
cgtcagcccc 
agatacccgc 
gatcaaggtg 
ggagaggaca 
gaaaatggcc 
gaatcagggc 
ccatatttgt 
aaataaagcc 
ctgggctgtc 
ctatattcaa 
ttgtgggagg 
aaagacaggc 
aggggaccct 
tctggggtca 
ttcctatggg 
gcactgtcta 
ttcttatgtg 
cagcggcacc 
tgtgggactt 
agtggtggta 
gtttgaggcc 
ctctttctca 
aggatctccc 
taaactcacg 
accatgcccc 
taccttcatg 
ttatggaagg 
tctgagccat 
tgttcttgtt 
gaactcactc 
agaatactgg 
gagtaatagt 
tggctcattt 


caggcttaca 

tctgttgttt 

ttccccactc 

gggagctggc 

tggaacccac 

aggacgatgg 

ctgactctca 

gaggcctcag 

atgggctgag 

atgcccactt 

tcaaatgcgt 

ctgtgcgcct 

tcatctatgc 

atgtggtgag 

aggtgaggcc 

ctagtgcgaa 

agaagagtca 

agcaatcggc 

agattccacg 

aagtatcgag 

gtgaagagag 

gagaaagccc 

tgagttggag 

tttttcagga 

tgtggctgtc 

gcgctttgtg 

actgggggtc 

cagtgacagg 

ggatggagtg 

agggtgggtg 

gagttccgac 

agtggccgga 

acgggcctgg 

actgactgcc 

ggtttatgct 

ctcgtaggcc 

tagaactgac 

cttgcttgga 

aggaggctgt 

ctgatggggg 

tcatagacct 

atttctccct 

actgttgctt 

gggtctcagt 

tcctgggtgt 

aagtaaggac 

gctctggctc 

gcctgcacgc 

ttgtacctca 

tatttattta 

cacaccatta 

agcctggtct 

aaaaataaaa 

tattagccct 

aagatcctcc 

actattttct 

cagtgctaat 

ttgggagtca 

ctccagcccc 

ttgttttgtt 

tgtagaccag 

aattaaaggt 

tacaggtttt 

gttttatttg 


209700 

209760 

209820 

209880 

209940 

210000 

210060 

210120 

210180 

210240 

210300 

210360 

210420 

210480 

210540 

210600 

210660 

210720 

210780 

210840 

210900 

210960 

211020 

211080 

211140 

211200 

211260 

211320 

211380 

211440 

211500 

211560 

211620 

211680 

211740 

211800 

211860 

211920 

211980 

212040 

212100 

212160 

212220 

212280 

212340 

212400 

212460 

212520 

212580 

212640 

212700 

212760 

212820 

212880 

212940 

213000 

213060 

213120 

213180 

213240 

213300 

213360 

213420 

213480 
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ttttgagaca 

ctgactttaa 

tctgatggtt 

atatcaccca 

cacagagatc 

ggtgagtacc 

gggcagcgtg 

acagaagccc 

cggcagcgag 

gccaccctct 

tctggctttg 

cctcctggca 

ggcccgctcc 

tacccacatg 

gcttcatgaa 

tgtgatcccc 

aggaaatgaa 

ctctgtttct 

ccaggctcag 

attcattcag 

gggaaggggt 

cctaaccaca 

ggacaggagg 

tctgccatct 

actagttgca 

tgaggctgtg 

ccaaggagct 

tggcttggct 

ccttcctgct 

ttttttaaag 

caccatactt 

cgaagttaat 

gcaagagcag 

ttgatacaag 

cctacctacc 

catgcccacc 

aaacacagca 

tgggttgggt 

gctgaagagg 

tggaacctag 

gctcctgcag 

tggtggtggt 


ggtctcactc 

acttgttttc 

gttaataaca 

gacagtggct 

tgcctcccga 

caagcctcca 

ctcatcagct 

acgcaccact 

gtggtcaccg 

caggccacca 

ctgcccacga 

tagggccagc 

tcattcttga 

gcctgctgtc 

actcgggaga 

attgctcccc 

aggagttggg 

cccttgggta 

gagggctggc 

ccaggtcttc 

tatatgggcc 

gcgtttgtgc 

cagcgggaat 

cttgagtgtt 

gtctatgtaa 

ttgtgttttg 

ggggaactgg 

ttttttcaga 

tgaaacagag 

gcattgattt 

gtgtctggtc 

gaatggttat 

ccagtgttct 

gtctggtagc 

ctcttctctc 

agtttgcctt 

agagggtaga 

ggccttgggt 

tgggaccctg 

gctggaggga 

cctccccttc 

ggtggtggtg 


tgaaccccta 

tgagtgctgg 

atattattta 

ggtctggaaa 

gttcagaaat 

ttgatgctat 

cctcttccga 

acaagtcagc 

gcactgtagc 

caaagctcag 

aacgccacct 

caggagttgg 

tctcgaagcc 

tgggattgca 

tactgttcct 

agttctgcct 

cctttgctca 

ggaaactgct 

caggacctac 

agtatgggac 

cagctggtcc 

cttcagccgg 

catctctgta 

cgtttggaga 

ataaggtcaa 

ctttgcctag 

tatatgtcac 

tggtctcaag 

tagtgggctt 

ttatacgtgt 

cctgcggagg 

gagccacatc 

tacccactga 

tcaaacttga 

taccttccaa 

gatggcacca 

ggcagccatt 

cagttttccg 

ggagggccgg 

gccaaccagg 

ccttgggtag 

gtgggggggg 


ggtggc c tgg 

atttatgggc 

gcttttcttt 

tcactgtgta 

taaaagcact 

ctgtccctcc 

ccattccttg 

gtccgaccca 

caacaagatt 

ttccgagaac 

cctgctgggc 

ctgagggcag 

gattcttcta 

cagctcctga 

aaccagcaag 

tctggattca 

gccagcttcc 

cctgtctagg 

gactttcagg 

cagaaaaaag 

aggaatgaat 

ggaggcagcc 

tctcgggttc 

agcctgagat 

taaattcttt 

gctgggctca 

tgtatatgtc 

tgcctcaggt 

ataggcatga 

atggttgttt 

tcagaagagg 

tcgatgctga 

gccatctcta 

tatgcagccg 

gtgctgatat 

gagtcaggaa 

gtcctctggc 

aatggttcac 

cctccagctg 

atcctggact 

ctgtggtggt 

gggngnngnt 


agcttgctat 

ttgtgctatt 

tggagatagg 

ggccaggctg 

ctgggatggt 

cgctctctgc 

actgtttgga 

atccacacct 

ggtgtctgtt 

ttccgtggca 

tcggacaatg 

ggcgagatga 

ggcaagcccc 

atctccaaag 

aattggggca 

catggggaca 

tctagccacg 

gttctgatgg 

gcttggtctg 

gggatgtgag 

ccatgccttg 

cttgggacca 

tgcccagggg 

gtggcccctg 

ggaagagcca 

ggcagctctg 

actgtgcctg 

tggccttgat 

cccaccaggt 

tgcccacttg 

gcatcgggat 

agattgaacc 

agccccacac 

aggaggttga 

tatacatagg 

agtccaaacc 

tgccttggat 

ccttggggaa 

ggtctctcca 

tgctacagtt 

ggtggtggtg 


gtagaacaca 

ttgcccagcc 

ctctcactgt 

accttgaatt 

tttggagttt 

aggctgtaga 

aggagctgga 

ttgacctgta 

ccctgcttga 

cgctcactag 

gcatcatccg 

catctctcag 

aggctctggc 

ccttgaagtg 

aggaaagcac 

gggcagctcc 

ctctccttag 

tactgggact 

gggttttagc 

aacagggcta 

ccttggtacc 

gcatccctag 

atgggcagac 

ctgccttctc 

cggagctgag 

cctcagcctc 

gcttatggct 

cttgggatga 

ccaattttta 

tacatatgca 

cacctggaac 

tggatccttt 

ccagcttctt 

cctggtattc 

catggatagt 

tggtagttgc 

acagagcttc 

agggaacact 

gccctcgcct 

aggtgaacag 

gtggtggtgg 


213540 

213600 

213660 

213720 

213780 

213840 

213900 

213960 

214020 

214080 

214140 

214200 

214260 

214320 

214380 

214440 

214500 

214560 

214620 

214680 

214740 

214800 

214860 

214920 

214980 

215040 

215100 

215160 

215220 

215280 

215340 

215400 

215460 

215520 

215580 

215640 

215700 

215760 

215820 

215880 

215940 

215980 


<210> 17 
<211> 473 
<212> PRT 
<213> Mu s sp . 

<4 00> 17 


Met 

1 

Lys 

Arg 

Ala 

Ser 

5 

Ser 

Gly 

Gly 

Ser 

Arg 

10 

Leu 

Leu 

Ala 

Trp 

Val 

15 

Leu 

Trp 

Leu 

Gin 

Ala 

20 

Trp 

Arg 

Val 

Ala 

Thr 

25 

Pro 

Cys 

Pro 

Gly 

Ala 

30 

Cys 

Val 

Cys 

Tyr 

Asn 

35 

Glu 

Pro 

Lys 

Val 

Thr 

40 

Thr 

Ser 

Cys 

Pro 

Gin 

45 

Gin 

Gly 

Leu 

Gin 

Ala 

50 

Val 

Pro 

Thr 

Gly 

lie 

55 

Pro 

Ala 

Ser 

Ser 

Gin 

60 

Arg 

lie 

Phe 

Leu 

His 

65 

Gly Asn 

Arg 

lie 

Ser 

70 

His 

Val 

Pro 

Ala 

Ala 

75 

Ser 

Phe 

Gin 

Ser 

Cys 

80 
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Arg 

Asn 

Leu 

Thr 

lie 

85 

Leu 

Trp 

Leu 

His 

Ser 

90 

Asn 

Ala 

Leu 

Ala 

Arg 

95 

lie 

Asp 

Ala 

Ala 

Ala 

100 

Phe 

Thr 

Gly 

Leu 

Thr 

105 

Leu 

Leu 

Glu 

Gin 

Leu 

110 

Asp 

Leu 

Ser 

Asp 

Asn 

115 

Ala 

Gin 

Leu 

His 

Val 

120 

Val 

Asp 

Pro 

Thr 

Thr 

125 

Phe 

His 

Gly 

Leu 

Gly 

130 

His 

Leu 

His 

Thr 

Leu 

135 

His 

Leu 

Asp 

Arg 

Cys 

140 

Gly 

Leu 

Arg 

Glu 

Leu 

145 

Gly 

Pro 

Gly 

Leu 

Phe 

150 

Arg 

Gly 

Leu 

Ala 

Ala 

155 

Leu 

Gin 

Tyr 

Leu 

Tyr 

160 

Leu 

Gin 

Asp 

Asn 

Asn 

165 

Leu 

Gin 

Ala 

Leu 

Pro 

170 

Asp 

Asn 

Thr 

Phe 

Arg Asp 
175 

Leu 

Gly 

Asn 

Leu 

180 

Thr 

His 

Leu 

Phe 

Leu 

185 

His 

Gly Asn 

Arg 

lie 

190 

Pro 

Ser 

Val 

Pro 

Glu 

195 

His 

Ala 

Phe 

Arg 

Gly 

200 

Leu 

His 

Ser 

Leu 

Asp 

205 

Arg 

Leu 

Leu 

Leu 

His 

210 

Gin 

Asn 

His 

Val 

Ala 

215 

Arg 

Val 

His 

Pro 

His 

220 

Ala 

Phe 

Arg 

Asp 

Leu 

225 

Gly Arg 

Leu 

Met 

Thr 

230 

Leu 

Tyr 

Leu 

Phe 

Ala 

235 

Asn 

Asn 

Leu 

Ser 

Met 

240 

Leu 

Pro 

Ala 

Glu 

Val 

245 

Leu 

Met 

Pro 

Leu 

Arg 

250 

Ser 

Leu 

Gin 

Tyr 

Leu 

255 

Arg 

Leu 

Asn 

Asp 

Asn 

260 

Pro 

Trp 

Val 

Cys 

Asp 

265 

Cys 

Arg 

Ala 

Arg 

Pro 

270 

Leu 

Trp 

Ala 

Trp 

Leu 

275 

Gin 

Lys 

Phe 

Arg 

Gly 

280 

Ser 

Ser 

Ser 

Glu 

Val 

285 

Pro 

Cys 

Asn 

Leu 

Pro 

290 

Gin 

Arg 

Leu 

Ala 

Asp 

295 

Arg Asp 

Leu 

Lys 

Arg 

300 

Leu 

Ala 

Ala 

Ser 

Asp 

305 

Leu 

Glu 

Gly 

Cys 

Ala 

310 

Val 

Ala 

Ser 

Gly 

Pro 

315 

Phe 

Arg 

Pro 

lie 

Gin 

320 

Thr 

Ser 

Gin 

Leu 

Thr 

325 

Asp 

Glu 

Glu 

Leu 

Leu 

330 

Ser 

Leu 

Pro 

Lys 

Cys 

335 

Cys 

Gin 

Pro 

Asp 

Ala 

340 

Ala 

Asp 

Lys 

Ala 

Ser 

345 

Val 

Leu 

Glu 

Pro 

Gly Arg 
350 

Pro 

Ala 

Ser 

Ala 

355 

Gly 

Asn 

Ala 

Leu 

Lys 

360 

Gly Arg 

Val 

Pro 

Pro 

365 

Gly Asp 

Thr 

Pro 

Pro 

370 

Gly 

Asn 

Gly 

Ser 

Gly 

375 

Pro 

Arg 

His 

He 

Asn 

380 

Asp 

Ser 

Pro 

Phe 

Gly 

385 

Thr 

Leu 

Pro 

Ser 

Ser 

390 

Ala 

Glu 

Pro 

Pro 

Leu 

395 

Thr 

Ala 

Leu 

Arg 

Pro 

400 

Gly 

Gly 

Ser 

Glu 

Pro 

405 

Pro 

Gly 

Leu 

Pro 

Thr 

410 

Thr 

Gly 

Pro 

Arg Arg Arg 
415 

Pro 

Gly 

Cys 

Ser 

Arg 

Lys 

Asn 

Arg 

Thr 

Arg 

Ser 

His 

Cys 

Arg 

Leu 

Gly 
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420 425 430 

Gin Ala Gly Ser Gly Ala Ser Gly Thr Gly Asp Ala Glu Gly Ser Gly 
435 440 445 

Ala Leu Pro Ala Leu Ala Cys Ser Leu Ala Pro Leu Gly Leu Ala Leu 
450 455 460 

Val Leu Trp Thr Val Leu Gly Pro Cys 
465 470 
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